Configurable IC having a routing fabric with storage elements

ABSTRACT

Some embodiments provide a configurable IC that includes a configurable routing fabric with storage elements. In some embodiments, the routing fabric provides a communication pathway that routes signals to and from source and destination components. The routing fabric of some embodiments provides the ability to selectively store the signals passing through the routing fabric within the storage elements of the routing fabric. In this manner, a source or destination component continually performs operations (e.g., computational or routing) irrespective of whether a previous signal from or to such a component is stored within the routing fabric. The source and destination components include configurable logic circuits, configurable interconnect circuits, and various other circuits that receive or distribute signals throughout the configurable IC.

CLAIM OF BENEFIT TO PRIOR APPLICATION

This application claims benefit to U.S. Provisional Patent Application60/895,946, filed Mar. 20, 2007 and the U.S. Provisional PatentApplication 60/915,108, filed Apr. 30, 2007. These United StatesProvisional patent applications are incorporated herein by reference.

CROSS REFERENCE TO RELATED APPLICATIONS

This Application is related to the following applications with the samefiling date: U.S. patent application Ser. No. 11/754,299, filed May 27,2007; and U.S. patent application Ser. No. 11/754,300, filed May 27,2007.

FIELD OF THE INVENTION

The present invention is directed towards configurable ICs having arouting fabric with storage elements for performing routing and storageoperations.

BACKGROUND

The use of configurable integrated circuits (“ICs”) has dramaticallyincreased in recent years. One example of a configurable IC is a fieldprogrammable gate array (“FPGA”). An FPGA is a field programmable ICthat often has logic circuits, interconnect circuits, and input/output(I/O) circuits. The logic circuits (also called logic blocks) aretypically arranged as an internal array of circuits. These logiccircuits are typically connected together through numerous interconnectcircuits (also called interconnects). The logic and interconnectcircuits are often surrounded by the I/O circuits.

FIG. 1 illustrates an example of a configurable logic circuit 100. Thislogic circuit can be configured to perform a number of differentfunctions. As shown in FIG. 1, the logic circuit 100 receives a set ofinput data 105 and a set of configuration data 110. The configurationdata set is stored in a set of SRAM cells 115. From the set of functionsthat the logic circuit 100 can perform, the configuration data setspecifies a particular function that this circuit has to perform on theinput data set. Once the logic circuit performs its function on theinput data set, it provides the output of this function on a set ofoutput lines 120. The logic circuit 100 is said to be configurable, asthe configuration data set “configures” the logic circuit to perform aparticular function, and this configuration data set can be modified bywriting new data in the SRAM cells. Multiplexers and look-up tables aretwo examples of configurable logic circuits.

FIG. 2 illustrates an example of a configurable interconnect circuit200. This interconnect circuit 200 connects a set of input data 205 to aset of output data 210. This circuit receives configuration data 215that are stored in a set of SRAM cells 220. The configuration dataspecify how the interconnect circuit should connect the input data setto the output data set. The interconnect circuit 200 is said to beconfigurable, as the configuration data set “configures” theinterconnect circuit to use a particular connection scheme that connectsthe input data set to the output data set in a desired manner. Moreover,this configuration data set can be modified by writing new data in theSRAM cells. Multiplexers are one example of interconnect circuits.

FIG. 3A illustrates a portion of a prior art configurable IC 300. Asshown in this figure, the IC 300 includes an array of configurable logiccircuits 305 and configurable interconnect circuits 310. The IC 300 hastwo types of interconnect circuits 310 a and 310 b. Interconnectcircuits 310 a connect interconnect circuits 310 b and logic circuits305, while interconnect circuits 310 b connect interconnect circuits 310a to other interconnect circuits 310 a.

In some cases, the IC 300 includes numerous logic circuits 305 andinterconnect circuits 310 (e.g., hundreds, thousands, hundreds ofthousands, etc. of such circuits). As shown in FIG. 3A, each logiccircuit 305 includes additional logic and interconnect circuits.Specifically, FIG. 3A illustrates a logic circuit 305 a that includestwo sections 315 a that together are called a slice. Each sectionincludes a look-up table (LUT) 320, a user register 325, a multiplexer330, and possibly other circuitry (e.g., carry logic) not illustrated inFIG. 3A.

The multiplexer 330 is responsible for selecting between the output ofthe LUT 320 or the user register 325. For instance, when the logiccircuit 305 a has to perform a computation through the LUT 320, themultiplexer 330 selects the output of the LUT 320. Alternatively, thismultiplexer selects the output of the user register 325 when the logiccircuit 305 a or a slice of this circuit needs to store data for afuture computation of the logic circuit 305 a or another logic circuit.

FIG. 3B illustrates an alternative way of constructing half a slice in alogic circuit 305 a of FIG. 3A. Like the half-slice 315 a in FIG. 3A,the half-slice 315 b in FIG. 3B includes a look-up table (LUT) 320, auser register 325, a multiplexer 330, and possibly other circuitry(e.g., carry logic) not illustrated in FIG. 3B. However, in thehalf-slice 315 b, the user register 325 can also be configured as alatch. In addition, the half-slice 315 b also includes a multiplexer350. In half-slice 315 b, the multiplexer 350 receives the output of theLUT 320 instead of the register/latch 325, which receives this output inhalf-slice 315 a. The multiplexer 350 also receives a signal fromoutside of the half-slice 315 b. Based on its select signal, themultiplexer 350 then supplies one of the two signals that it receives tothe register/latch 325. In this manner, the register/latch 325 can beused to store (1) the output signal of the LUT 320 or (2) a signal fromoutside the half-slice 315 b.

The use of user registers to store such data is at times undesirable, asit typically requires data to be passed at a clock's rising edge or aclock's fall edge. In other words, registers often do not provideflexible control over the data passing between the various circuits ofthe configurable IC. In addition, the placement of a register or a latchin the logic circuit increases the signal delay through the logiccircuit, as it requires the use of at least one multiplexer 330 toselect between the output of a register/latch 325 and the output of aLUT 320. The placement of a register or a latch in the logic circuitfurther hinders the design of an IC as the logic circuit becomesrestricted to performing either storage operations or logic operations,but not both.

Accordingly, there is a need for a configurable IC that has a moreflexible approach for storing data and passing data. More generally,there is a need for more flexible storage mechanisms in configurableICs.

SUMMARY OF THE INVENTION

Some embodiments provide a configurable IC that includes a configurablerouting fabric with storage elements. In some embodiments, the routingfabric provides a communication pathway that routes signals to and fromsource and destination components. The routing fabric of someembodiments provides the ability to selectively store the signalspassing through the routing fabric within the storage elements of therouting fabric. In this manner, a source or destination componentcontinually performs operations (e.g., computational or routing)irrespective of whether a previous signal from or to such a component isstored within the routing fabric. The source and destination componentsinclude configurable logic circuits, configurable interconnect circuits,and various other circuits that receive or distribute signals throughoutthe configurable IC.

In some embodiments, the routing fabric includes configurableinterconnect circuits, the wire segments (e.g., the metal or polysiliconsegments) that connect to the interconnect circuits, and vias thatconnect to these wire segments and to the terminals of the interconnectcircuits. In some of these embodiments, the routing fabric also includesbuffers for achieving one or more objectives (e.g., maintaining thesignal strength, reducing noise, altering signal delay, etc.) visa vithe signals passing along the wire segments. In conjunction with orinstead of these buffer circuits, the routing fabric of some embodimentsmight also include one or more non-configurable circuits (e.g.,non-configurable interconnect circuits).

Different embodiments place storage elements at different locations inthe routing fabric. Examples of such locations include storage elementscoupled to or within the output stage of interconnect circuits, storageelements coupled to, cross-coupled to, or adjacent to buffer circuits inthe routing fabric, and storage elements at other locations of therouting fabric.

For instance, in some embodiments, the routing fabric includes aparallel distributed path (PDP) for an output of a source component thatis being routed through the routing fabric to an input of a destinationcomponent. A PDP includes a first path and a second path. The first pathdirectly routes the output of the source to a first input of thedestination, while the second path runs in parallel with the first pathand passes the output of the source through a controllable storageelement before reaching a second input of the destination. The storageelement stores the output value of the source circuit when enabled. Inaddition to reaching the same destination component, some embodimentsallow the second path to fan out to other destination components thanthe first path. In some embodiments, both the first and second paths ofa PDP emanate from the output of an interconnect circuit that receivesthe output of the source component.

In some embodiments, the routing fabric includes interconnect circuitswith storage elements located at their output stage. For a particularinterconnect circuit that connects a particular source circuit to aparticular destination circuit, the output of the particularinterconnect circuit's storage element connects to an input of thedestination circuit. When enabled, this storage holds the output of thesource circuit for a particular duration (e.g., for one or more userdesign clock cycles or one or more sub-cycles). Typically, such astorage element is used to store data for a relatively small amount oftime as its storage operation prevents the interconnect circuit fromperforming its routing operation. Accordingly, at times, this storageelement is referred to below as a short-term storage element.

In addition to placing a short-term storage element at the output stageof an interconnect circuit, some embodiments place a “long-term” storageelement in a feedback path between an output and input of theinterconnect circuit. Such a storage element is referred to as along-term storage element as it can be used to store data for a timeduration that can be relatively long as the storage element does notdisable the interconnect circuit's routing operation. In other words,the placement of the storage element in a feedback path of theinterconnect circuit allows the interconnect circuit to continueperforming its routing operations even when the storage element storesdata. In some embodiments, either the short term or long term storageelement of an interconnect circuit is performing a storage operation atany given time.

Some embodiments place the long-term storage element and the feedbackpath in series with the short term storage element. For instance, insome embodiments, the output of the interconnect circuit that passesthrough the short term storage element (1) is distributed to adestination component and (2) is distributed along the feedback paththrough the long term storage element to an input of the interconnectcircuit.

Other embodiments position the long-term storage element and thefeedback path in parallel with the short-term storage element. Forinstance, the output of the interconnect circuit can be distributedalong two separate output paths. The first output path passes the outputof the interconnect circuit through the short-term storage beforereaching the input of a destination circuit (where in some embodimentsthis path reaches the destination circuit's input possibly through oneor more wire segments, vias, and buffers). The second parallel outputpath passes the output of the interconnect circuit through the long-termstorage element along the feedback path before passing this output backto an input of the interconnect circuit.

Some embodiments do not utilize any short-term storage at the output ofan interconnect circuit, but only utilize a long-term storage in afeedback path between the output and input of an interconnect circuit.Other embodiments utilize a long-term storage that receives the outputof an interconnect circuit but does not supply its output back to thesame interconnect circuit. Instead, this long-term storage routes itsoutput to an input of another interconnect circuit.

The PDP, short-term, and long-term storage elements are controllablestorage elements that can controllably store data for arbitrarydurations of time. In some embodiments, some or all of these storageelements are controlled by user design signals. In some embodiments,some or all of these storage elements are configurable storage elementswhose storage operation is at least partly controlled by a set ofconfiguration data stored in configuration data storage of the IC. Forinstance, in some embodiments, the set of configuration bits determinesthe clock cycles in which a PDP, short-term, or long-term storageelement receives and stores data.

BRIEF DESCRIPTION OF THE DRAWINGS

The novel features of the invention are set forth in the appendedclaims. However, for the purpose of explanation, several embodiments ofthe invention are set forth in the following figures.

FIG. 1 illustrates an example of a configurable logic circuit.

FIG. 2 illustrates an example of a configurable interconnect circuit.

FIG. 3A illustrates a portion of a prior art configurable IC.

FIG. 3B illustrates an alternative way of constructing half a slice in alogic circuit of FIG. 3A.

FIG. 4 illustrates a configurable circuit architecture that is formed bynumerous configurable tiles that are arranged in an array with multiplerows and columns.

FIG. 5 provides one possible physical architecture of the configurableIC illustrated in FIG. 4.

FIG. 6 illustrates the detailed tile arrangement of some embodiments.

FIG. 7 illustrates an example of a sub-cycle reconfigurable IC.

FIG. 8 provides an illustrative embodiment of the functionality providedby placing storage elements within the routing fabric of a configurableIC.

FIG. 9 illustrates placement of a storage element within the routingfabric of a configurable IC.

FIG. 10 illustrates a circuit representation of a storage circuit.

FIG. 11 illustrates another alternative implementation of a storagecircuit.

FIG. 12 illustrates an implementation of a storage circuit within therouting fabric.

FIG. 13A illustrates a storage circuit with a parallel distributedoutput path for providing simultaneous routing and storage capability atthe interconnect.

FIG. 13B illustrates a storage circuit with a parallel distributed inwhich the parallel path is distributed to multiple destinations.

FIG. 14 illustrates a circuit for generating a parallel distributedoutput path.

FIG. 15 illustrates a cross-coupling transistor storage element.

FIG. 16A illustrates a circuit representation for a first tri-stateinverter of FIG. 15.

FIG. 16B illustrates a circuit representation for a second tri-stateinverter of FIG. 15.

FIG. 17 illustrates a storage element within the routing fabric with afeedback path connected in series to the output of a routing circuit.

FIG. 18 illustrates an embodiment for the circuit of FIG. 17.

FIG. 19 presents an alternative placement for the storage element of thestorage circuit of FIG. 18.

FIG. 20 illustrates a storage element within the routing fabric with afeedback path connected in parallel to the output of a routing circuit.

FIG. 21 illustrates an embodiment for the circuit of FIG. 20.

FIG. 22 present a circuit representation for a multiplexer containing aparallel set of complementary outputs.

FIG. 23 presents an alternative placement for the storage element of thestorage circuit of FIG. 21.

FIG. 24A illustrates a storage element within the routing fabric with afeedback path connected in series to the output of a routing circuit.

FIG. 24B illustrates a storage element within the routing fabric with afeedback path connected in parallel to the output of a routing circuit.

FIG. 25 illustrates a pair of storage elements connected to the outputstage of a routing circuit.

FIG. 26 illustrates a pair of storage elements along a paralleldistributed output path.

FIG. 27 illustrates using multiple storage elements within the routingfabric for providing long term storage.

FIG. 28 provides an illustrative embodiment of the functionalityprovided by placing storage elements within the routing fabric.

FIG. 29 illustrates an alternative placement and use of multiple storageelements within the routing fabric to provide long term storage.

FIG. 30 illustrates a portion of a configurable IC of some embodimentsof the invention.

FIG. 31 illustrates a more detailed example of data between aconfigurable node and a configurable circuit arrangement that includesconfiguration data that configure the nodes to perform particularoperations.

FIG. 32 illustrates a system on chip (“SoC”) implementation of aconfigurable IC.

FIG. 33 illustrates an embodiment that employs a system in package(“SiP”) implementation for a configurable IC.

FIG. 34 conceptually illustrates a more detailed example of a computingsystem that has an IC, which includes one of the invention'sconfigurable circuit arrangements.

DETAILED DESCRIPTION

In the following description, numerous details are set forth for purposeof explanation. However, one of ordinary skill in the art will realizethat the invention may be practiced without the use of these specificdetails. For instance, not all embodiments of the invention need to bepracticed with the specific number of bits and/or specific devices(e.g., multiplexers) referred to below. In other instances, well-knownstructures and devices are shown in block diagram form in order not toobscure the description of the invention with unnecessary detail.

I. Overview

Some embodiments provide a configurable IC that includes a configurablerouting fabric with storage elements. In some embodiments, the routingfabric provides a communication pathway that routes signals to and fromsource and destination components. The routing fabric of someembodiments provides the ability to selectively store the signalspassing through the routing fabric within the storage elements of therouting fabric. In this manner, a source or destination componentcontinually performs operations (e.g., computational or routing)irrespective of whether a previous signal from or to such a component isstored within the routing fabric. The source and destination componentsinclude configurable logic circuits, configurable interconnect circuits,and various other circuits that receive or distribute signals throughoutthe configurable IC.

In some embodiments, the routing fabric includes routing circuits, thewire segments (e.g., the metal or polysilicon segments) that connect tothe routing circuits, and vias that connect to these wire segments andto the terminals of the routing circuits. In some of these embodiments,the routing fabric also includes buffers for achieving one or moreobjectives (e.g., maintaining the signal strength, reducing noise,altering signal delay, etc.) visa vi the signals passing along the wiresegments. In conjunction with or instead of these buffer circuits, therouting fabric of some embodiments might also include one or morenon-configurable circuits (e.g., non-configurable interconnectcircuits).

Different embodiments place storage elements at different locations inthe routing fabric. Examples of such locations include storage elementscoupled to or within the output stage of routing circuits, storageelements coupled to, cross-coupled to, or adjacent to buffer circuits inthe routing fabric, and storage elements at other locations of therouting fabric.

For instance, in some embodiments, the routing fabric includes aparallel distributed path (PDP) for an output of a source component thatis being routed through the routing fabric to an input of a destinationcomponent. A PDP includes a first path and a second path. The first pathdirectly routes the output of the source to a first input of thedestination, while the second path runs in parallel with the first pathand passes the output of the source through a controllable storageelement before reaching a second input of the destination. The storageelement stores the output value of the source circuit when enabled. Inaddition to reaching the same destination component, some embodimentsallow the second path to fan out to other destination components thanthe first path. In some embodiments, both the first and second paths ofa PDP emanate from the output of a routing circuit that receives theoutput of the source component.

In some embodiments, the routing fabric includes routing circuits withstorage elements located at their output stage. For a particular routingcircuit that connects a particular source circuit to a particulardestination circuit, the output of the particular routing circuit'sstorage element connects to an input of the destination circuit. Whenenabled, this storage holds the output of the source circuit for aparticular duration (e.g., for one or more user design clock cycles orone or more sub-cycles). Typically, such a storage element is used tostore data for a relatively small amount of time as its storageoperation prevents the routing circuit from performing its routingoperation. Accordingly, at times, this storage element is referred tobelow as a short-term storage element.

In addition to placing a short-term storage element at the output stageof a routing circuit, some embodiments place a “long-term” storageelement in a feedback path between an output and input of the routingcircuit. Such a storage element is referred to as a long-term storageelement as it can be used to store data for a time duration that can berelatively long as the storage element does not disable the routingcircuit's routing operation. In other words, the placement of thestorage element in a feedback path of the routing circuit allows therouting circuit to continue performing its routing operations even whenthe storage element stores data. In some embodiments, either the shortterm or long term storage element of an interconnect circuit isperforming a storage operation at any given time.

Some embodiments place the long-term storage element and the feedbackpath in series with the short term storage element. For instance, insome embodiments, the output of the routing circuit that passes throughthe short term storage element (1) is distributed to a destinationcomponent and (2) is distributed along the feedback path through thelong term storage element to an input of the routing circuit.

Other embodiments position the long-term storage element and thefeedback path in parallel with the short-term storage element. Forinstance, the output of the routing circuit can be distributed along twoseparate output paths. The first output path passes the output of therouting circuit through the short-term storage before reaching the inputof a destination circuit (where in some embodiments this path reachesthe destination circuit's input possibly through one or more wiresegments, vias, and buffers). The second parallel output path passes theoutput of the routing circuit through the long-term storage elementalong the feedback path before passing this output back to an input ofthe routing circuit.

Some embodiments do not utilize any short-term storage at the output ofa routing circuit, but only utilize a long-term storage in a feedbackpath between the output and input of a routing circuit. Otherembodiments utilize a long-term storage that receives the output of arouting circuit but does not supply its output back to the same routingcircuit. Instead, this long-term storage routes its output to an inputof another routing circuit.

The PDP, short-term, and long-term storage elements are controllablestorage elements that can controllably store data for arbitrarydurations of time. In some embodiments, some or all of these storageelements are controlled by user design signals. In some embodiments,some or all of these storage elements are configurable storage elementswhose storage operation is at least partly controlled by a set ofconfiguration data stored in configuration data storage of the IC. Forinstance, in some embodiments, the set of configuration data determinesthe clock cycles or sub-cycles in which a PDP, short-term, or long-termstorage element receives and stores data.

Some embodiments implement the storage elements described above usingregisters for all of the storage elements. Other embodiments use latchesfor some or all the storage elements. In some situations, latches haveseveral advantages over registers. For instance, registers are edgetriggered, i.e., their operation is driven by the rising or falling edgeof a user design clock cycle or sub-cycle. This limitation on theiroperation imposes an arbitrary temporal restriction on when data can bepassed between a register and other circuits. Latches, on the otherhand, do not suffer from such arbitrary constraints as they can operatesolely in response to an enable signal. Hence, they can typicallyoperate transparently in response to enable signals that can even beasynchronous. This ability to operate transparently allows theoperations of the latches to adjust flexibly to receive and output datawhenever such data is provided or needed.

Some embodiments use complementary pass logic to implement some or allof their circuits. Some of these embodiments use a set of cross-couplingtransistors to form some or all the storage elements. Cross-couplingtransistors remove the signal delay associated with traditional storageelements such as registers or latches. Also, cross-coupling transistorsoperate solely in response to an enable signal and therefore allow thestorage elements to operate transparently in response to the enablesignal.

Several more detailed embodiments of the invention are described in thesections below. Before describing these embodiments further, an overviewof the configurable IC architecture used by some embodiments toimplement the routing fabric with storage elements is given in SectionII below. This discussion is followed by the discussion in Section IIIof an overview of the reconfigurable IC architecture used by someembodiments to implement the routing fabric with storage elements. Next,Section IV describes various implementations of a configurable IC thatincludes storage elements in its routing fabric. Last, Section Vdescribes an electronics system that has an IC which implements some ofthe embodiments of the invention.

II. Configurable IC Architecture

An IC is a device that includes numerous electronic components (e.g.,transistors, resistors, diodes, etc.) that are embedded typically on thesame substrate, such as a single piece of semiconductor wafer. Thesecomponents are connected with one or more layers of wiring to formmultiple circuits, such as Boolean gates, memory cells, arithmeticunits, controllers, decoders, etc. An IC is often packaged as a singleIC chip in one IC package, although some IC chip packages can includemultiple pieces of substrate or wafer.

A configurable IC is an integrated circuit (IC) that has configurablecircuits. A configurable circuit is a circuit that can “configurably”perform a set of operations. Specifically, a configurable circuitreceives a configuration data set that specifies the operation that theconfigurable circuit has to perform in the set of operations that it canperform. In some embodiments, configuration data is generated outside ofthe configurable IC. In these embodiments, a set of software toolstypically converts a high-level IC design (e.g., a circuitrepresentation or a hardware description language design) into a set ofconfiguration data bits that can configure the configurable IC (or moreaccurately, the configurable ICs configurable circuits) to implement theIC design.

Examples of configurable circuits include configurable interconnectcircuits and configurable logic circuits. A logic circuit is a circuitthat can perform a function on a set of input data that it receives. Aconfigurable logic circuit is a logic circuit that can be configured toperform different functions on its input data set.

A configurable interconnect circuit is a circuit that can configurablyconnect an input set to an output set in a variety of manners. Aninterconnect circuit can connect two terminals or pass a signal from oneterminal to another by establishing an electrical path between theterminals. Alternatively, an interconnect circuit can establish aconnection or pass a signal between two terminals by having the value ofa signal that appears at one terminal appear at the other terminal. Inconnecting two terminals or passing a signal between two terminals, aninterconnect circuit in some embodiments might invert the signal (i.e.,might have the signal appearing at one terminal inverted by the time itappears at the other terminal). In other words, the interconnect circuitof some embodiments implements a logic inversion operation inconjunction to its connection operation. Other embodiments, however, donot build such an inversion operation in some or all of theirinterconnect circuits.

The configurable IC of some embodiments includes configurable logiccircuits and configurable interconnect circuits for routing the signalsto and from the configurable logic circuits. In addition to configurablecircuits, a configurable IC also typically includes non-configurablecircuits (e.g., non-configurable logic circuits, interconnect circuits,memories, etc.).

In some embodiments, the configurable circuits might be organized in anarrangement that has all the circuits organized in an array with severalaligned rows and columns. In addition, within such a circuit array, someembodiments disperse other circuits (e.g., memory blocks, processors,macro blocks, IP blocks, SERDES controllers, clock management units,etc.). FIGS. 4-6 illustrate several configurable circuitarrangements/architectures that include the invention's circuits. Onesuch architecture is illustrated in FIG. 4.

The architecture of FIG. 4 is formed by numerous configurable tiles 405that are arranged in an array with multiple rows and columns. In FIG. 4,each configurable tile includes a configurable three-input LUT 410,three configurable input-select multiplexers 415, 420, and 425, and twoconfigurable routing multiplexers 430 and 435. Different embodimentshave different number of configurable interconnect circuits 430. Forinstance, some embodiments may have eight configurable interconnectcircuits while others may have more or less such circuits. For eachconfigurable circuit, the configurable IC 400 includes a set of storageelements (e.g., a set of SRAM cells) for storing a set of configurationdata bits.

In some embodiments, the logic circuits are look-up tables (LUTs) whilethe interconnect circuits are multiplexers. Also, in some embodiments,the LUTs and the multiplexers are sub-cycle reconfigurable circuits. Insome of these embodiments, the configurable IC stores multiple sets ofconfiguration data for a sub-cycle reconfigurable circuit, so that thereconfigurable circuit can use a different set of configuration data indifferent sub-cycles. Other configurable tiles can include other typesof circuits, such as memory arrays instead of logic circuits.

In FIG. 4, an input-select multiplexer (also referred to as an IMUX) 415is an interconnect circuit associated with the LUT 410 that is in thesame tile as the input select multiplexer. One such input selectmultiplexer receives several input signals for its associated LUT andpasses one of these input signals to its associated LUT. In someembodiments, some of the input-select multiplexers are hybridinput-select/logic circuits (referred to as HMUXs) capable of performinglogic operations as well as functioning as input select multiplexers. AnHMUX is a multiplexer that can receive “user-design signals” along itsselect lines.

A user-design signal within a configurable IC is a signal that isgenerated by a circuit (e.g., logic circuit) of the configurable IC. Theword “user” in the term “user-design signal” connotes that the signal isa signal that the configurable IC generates for a particular applicationthat a user has configured the IC to perform. User-design signal isabbreviated to user signal in some of the discussion in this document.In some embodiments, a user signal is not a configuration or clocksignal that is generated by or supplied to the configurable IC. In someembodiments, a user signal is a signal that is a function of at least aportion of the set of configuration data received by the configurable ICand at least a portion of the inputs to the configurable IC. In theseembodiments, the user signal can also be dependent on (i.e., can also bea function of) the state of the configurable IC. The initial state of aconfigurable IC is a function of the set of configuration data receivedby the configurable IC and the inputs to the configurable IC. Subsequentstates of the configurable IC are functions of the set of configurationdata received by the configurable IC, the inputs to the configurable IC,and the prior states of the configurable IC.

In FIG. 4, a routing multiplexer (also referred to as an RMUX) 430 is aninterconnect circuit that at a macro level connects other logic and/orinterconnect circuits. In other words, unlike an input selectmultiplexer in these figures that only provides its output to a singlelogic circuit (i.e., that only has a fan out of 1), a routingmultiplexer in some embodiments either provides its output to severallogic and/or interconnect circuits (i.e., has a fan out greater than 1),or provides its output to at least one other interconnect circuit.

In some embodiments, the RMUXs depicted in FIG. 4 form the routingfabric along with the wire-segments that connect to the RMUXs, and thevias that connect to these wire segments and/or to the RMUXs. In someembodiments, the routing fabric further includes buffers for achievingone or more objectives (e.g., maintain the signal strength, reducenoise, alter signal delay, etc.) visa vi the signals passing along thewire segments.

Various wiring architectures can be used to connect the RMUXs, IMUXs,and LUTs. Several examples of the wire connection scheme are describedin U.S. application Ser. No. 11/082,193 entitled “Configurable IC withRouting Circuits with Offset Connections”, filed on Mar. 15, 2005.

Several embodiments are described below by reference to a “directconnection.” In some embodiments, a direct connection is establishedthrough a combination of one or more wire segments, and potentially oneor more vias, but no intervening circuit. In some embodiments, a directconnection might however include one or more intervening buffer circuitsbut no other type of intervening circuits. In yet other embodiments, adirect connection might include intervening non-configurable circuitsinstead of or in conjunction with buffer circuits. In some of theseembodiments, the intervening non-configurable circuits includeinterconnect circuits, while in other embodiments they do not includeinterconnect circuits.

In the discussion below, two circuits might be described as directlyconnected. This means that the circuits are connected through adirection connection. Also, some connections are referred to below asconfigurable connections and some circuits are described as configurablyconnected. Such references signifies that the circuits are connectedthrough a configurable interconnect circuit (such as a configurablerouting circuit).

In some embodiments, the examples illustrated in FIG. 4 represent theactual physical architecture of a configurable IC. However, in otherembodiments, the examples illustrated in FIG. 4 topologically illustratethe architecture of a configurable IC (i.e., they conceptually show theconfigurable IC without specifying a particular geometric layout for theposition of the circuits).

In some embodiments, the position and orientation of the circuits in theactual physical architecture of a configurable IC are different from theposition and orientation of the circuits in the topological architectureof the configurable IC. Accordingly, in these embodiments, the ICsphysical architecture appears quite different from its topologicalarchitecture. For example, FIG. 5 provides one possible physicalarchitecture of the configurable IC 400 illustrated in FIG. 4.

Having the aligned tile layout with the same circuit elements of FIG. 5simplifies the process for designing and fabricating the IC, as itallows the same circuit designs and mask patterns to be repetitivelyused to design and fabricate the IC. In some embodiments, the similaraligned tile layout not only has the same circuit elements but also havethe same exact internal wiring between their circuit elements. Havingsuch layout further simplifies the design and fabrication processes asit further simplifies the design and mask making processes.

Some embodiments might organize the configurable circuits in anarrangement that does not have all the circuits organized in an arraywith several aligned rows and columns. Therefore, some arrangements mayhave configurable circuits arranged in one or more arrays, while otherarrangements may not have the configurable circuits arranged in anarray.

Some embodiments might utilize alternative tile structures. Forinstance, FIG. 6 illustrates an alternative tile structure that is usedin some embodiments. This tile 600 has two sets 605 of 4-aligned LUTsalong with their associated IMUXs. It also includes six sets 610 ofRMUXs and five banks 615 of configuration RAM storage. Each 4-alignedLUT tile shares one carry chain. One example of which is described inU.S. application Ser. No. 11/082,193 entitled “Configurable IC withRouting Circuits with Offset Connections”, filed on Mar. 15, 2005. Oneof ordinary skill in the art would appreciate that other organizationsof LUT tiles may also be used in conjunction with the invention and thatthese organizations might have fewer or additional tiles.

III. Reconfigurable IC Architecture

Some embodiments of the invention can be implemented in a reconfigurableintegrated circuit that has reconfigurable circuits that reconfigure(i.e., base their operation on different sets of configuration data) oneor more times during the operation of the IC. Specifically,reconfigurable ICs are configurable ICs that can reconfigure duringruntime. A reconfigurable IC typically includes reconfigurable logiccircuits and/or reconfigurable interconnect circuits, where thereconfigurable logic and/or interconnect circuits are configurable logicand/or interconnect circuits that can “reconfigure” more than once atruntime. A configurable logic or interconnect circuit reconfigures whenit bases its operation on a different set of configuration data.

A reconfigurable circuit of some embodiments that operates on four setsof configuration data receives its four configuration data setssequentially in an order that loops from the first configuration dataset to the last configuration data set. Such a sequentialreconfiguration scheme is referred to as a 4 “loopered” scheme. Otherembodiments, however, might be implemented as six or eight looperedsub-cycle reconfigurable circuits. In a six or eight looperedreconfigurable circuit, a reconfigurable circuit receives six or eightconfiguration data sets in an order that loops from the lastconfiguration data set to the first configuration data set.

FIG. 7 conceptually illustrates an example of a sub-cycle reconfigurableIC (i.e., an IC that is reconfigurable on a sub-cycle basis). In thisexample, the sub-cycle reconfigurable IC implements an IC design 705that operates at a clock speed of X MHz. The operations performed by thecomponents in the IC design 705 can be partitioned into four sets ofoperations 720-735, with each set of operations being performed at aclock speed of X MHz.

FIG. 7 then illustrates that these four sets of operations 720-735 canbe performed by one sub-cycle reconfigurable IC 710 that operates at 4XMHz. In some embodiments, four cycles of the 4X MHz clock correspond tofour sub-cycles within a cycle of the X MHz clock. Accordingly, thisfigure illustrates the reconfigurable IC 710 reconfiguring four timesduring four cycles of the 4X MHz clock (i.e., during four sub-cycles ofthe X MHz clock). During each of these reconfigurations (i.e., duringeach sub-cycle), the reconfigurable IC 710 performs one of theidentified four sets of operations. In other words, the fasteroperational speed of the reconfigurable IC 710 allows this IC toreconfigure four times during each cycle of the X MHz clock, in order toperform the four sets of operations sequentially at a 4X MHz rateinstead of performing the four sets of operations in parallel at an XMHz rate.

IV. Storage Elements Within the Routing Fabric

As mentioned above, the configurable routing fabric of some embodimentsis formed by configurable RMUXs along with the wire-segments thatconnect to the RMUXs, vias that connect to these wire segments and/or tothe RMUXs, and buffers that buffer the signals passing along one or moreof the wire segments. In addition to these components, the routingfabric of some embodiments further includes configurable storageelements.

Having the storage elements within the routing fabric is highlyadvantageous. For instance, such storage elements obviate the need toroute data computed by a source component to a second component thatstores the computed data before routing the data to a destinationcomponent that will use the data. Instead, such computed data can bestored optimally within storage elements located along the routing pathsbetween source and destination components, which can be logic and/orinterconnect circuits within the IC.

Such storage functionality within the routing fabric is ideal when insome embodiments the destination component is unable to receive orprocess the signal from the source component during a certain timeperiod. This functionality is also useful in some embodiments when asignal from a source component has insufficient time to traverse thedefined route to reach the destination within a single clock cycle orsub-cycle and needs to be temporarily stored along the route beforereaching the destination in a later clock cycle (e.g., user-design clockcycle) or in a later sub-cycle in case of a sub-cycle reconfigurable IC.By providing storage within the routing fabric, the source anddestination components continue to perform operations (e.g.,computational or routing) during the required storage time period.

FIG. 8 provides an illustrative example of the functionality provided byplacing storage elements within the routing fabric of a configurable IC.In FIG. 8, a component 810 is outputting a signal for processing bycomponent 820 at clock cycle 1. However, component 820 is receiving asignal from component 830 at clock cycles 1 and 2 and a signal fromcomponent 840 at clock cycle 3. Therefore, the signal from 810 may notbe routed to 820 until clock cycle 4. Hence, the signal is stored withinthe storage element 850 located within the routing fabric. By storingthe signal from 810 within the routing fabric during clock cycles 1through 3, components 810 and 820 remain free to perform otheroperations during this time period. At clock cycle 4, 820 is ready toreceive the stored signal and therefore the storage element 850 releasesthe value. It should be apparent to one of ordinary skill in the artthat the clock cycles of some embodiments described above could beeither (1) sub-cycles within or between different user design clockcycles of a reconfigurable IC, (2) user-design clock cycles, or (3) anyother clock cycle.

FIG. 9 illustrates several examples of different types of controllablestorage elements 930-960 that can be located throughout the routingfabric 910 of a configurable IC. Each storage element 930-960 can becontrollably enabled to store an output signal from a source componentthat is to be routed through the routing fabric to some destinationcomponent. In some embodiments, some or all of these storage elementsare configurable storage elements whose storage operation is controlledby a set of configuration data stored in configuration data storage ofthe IC. U.S. patent application Ser. No. 11/081,859 describes atwo-tiered multiplexer structure for retrieving enable signals on asub-cycle basis from configuration data storage for a particularconfigurable storage. It also describes building the first tier of suchmultiplexers within the output circuitry of the configuration storagethat stores a set of configuration data. Such multiplexer circuitry canbe used in conjunction with the configurable storage elements describedabove and below. U.S. patent application Ser. No. 11/081,859 isincorporated herein by reference.

As illustrated in FIG. 9, outputs are generated from the circuitelements 920. The circuit elements 920 are configurable logic circuits(e.g., 3-input LUTs and their associated IMUXs as shown in expansion905), while they are other types of circuits in other embodiments. Insome embodiments, the outputs from the circuit elements 920 are routedthrough the routing fabric 910 where the outputs can be controllablystored within the storage elements 930-960 of the routing fabric.Storage element 930 is a storage element that is coupled to the outputof a routing multiplexer. This storage element will be further describedbelow by reference to FIGS. 10 and 11. Storage element 940 includes arouting circuit with a parallel distributed output path in which one ofthe parallel distributed paths contains a storage element. This storageelement will be further described below by reference to FIGS. 13A and13B. Storage elements 950 and 960 include a routing circuit with a setof storage elements in which a second storage element is connected inseries or in parallel to the output path of the routing circuit. Storageelement 950 will be further described below by reference to FIG. 17 andstorage element 960 by reference to FIG. 20.

One of ordinary skill in the art will realize that the depicted storageelements within the routing fabric sections of FIG. 9 only present someembodiments of the invention and do not include all possible variations.Some embodiments use all these types of storage elements, while otherembodiments do not use all these types of storage elements (e.g., useone or two of these types).

A. Storage Element at Output of a Routing Multiplexer

FIG. 10 illustrates a circuit representation of the storage element 930.In some embodiments, the storage element 930 is a latch 1005 that isbuilt in or placed at the output stage of a multiplexer 1010. The latch1005 receives a latch enable signal. When the latch enable signal isinactive, the circuit simply acts as a routing circuit. On the otherhand, when the latch enable signal is active, the circuit acts as alatch that outputs the value that the circuit was previously outputtingwhile serving as a routing circuit. Accordingly, when another circuit ina second later configuration cycle needs to receive the value of circuit1000 in a first earlier configuration cycle, the circuit 1000 can beused. The circuit 1000 may receive and latch the value in a cycle beforethe second later configuration cycle (e.g., in the first earlier cycle)and output the value to the second circuit in the second latersub-cycle.

FIG. 11 illustrates an implementation of the circuit 1000, where thelatch is built into the output stage of the multiplexer 1010 by using apair of cross-coupling transistors. As shown in this figure, the circuit1100 includes (1) one set of input buffers 1105, (2) three sets 1110,1115, and 1120 of NMOS pass gate transistors, (3) two pull-up PMOStransistors 1125 and 1130, (4) two inverting output buffers 1135 and1140, and (5) two cross-coupling transistors 1145 and 1150.

The circuit 1100 is an eight-to-one multiplexer that can also serve as alatch. The inclusions of the two transistors 1145 and 1150 that crosscouple the two output buffers 1135 and 1140 and the inclusion of theenable signal with a signal that drives the last set 1120 of the passtransistors of the eight-to-one multiplexer allow the eight-to-onemultiplexer 1100 to act as a storage element whenever the enable signalis active (which, in this case, means whenever the enable signal ishigh).

In a CPL implementation of a circuit, a complementary pair of signalsrepresents each logic signal, where an empty circle at or a bar over theinput or output of a circuit denotes the complementary input or outputof the circuit in the figures. In other words, the circuit receives trueand complement sets of input signals and provides true and complementsets of output signals. Accordingly, in the multiplexer 1100 of FIG. 11,one subset of the input buffers 1105 receives eight input bits (0-7),while another subset of the input buffers 1105 receives the complementof the eight inputs bits. These input buffers serve to buffer the firstset 1110 of pass transistors.

The first set 1110 of pass transistors receive the third select bit S2or the complement of this bit, while the second set 1115 of passtransistors receive the second select bit S1 or the complement of thisbit. The third set 1120 of pass transistors receive the first select bitor its complement after this bit has been “AND'ed” by the complement ofthe enable signal. When the enable bit is not active (i.e., in thiscase, when the enable bit is low), the three select bits S2, S1, and S0cause the pass transistors to operate to pass one of the input bits andthe complement of this input bit to two intermediate output nodes 1155and 1160 of the circuit 1100. For instance, when the enable signal islow, and the select bits are 011, the pass transistors 1165 a, 1170 a,1175 a, and 1165 b, 1170 b, and 1175 b turn on to pass the 6 and 6 inputsignals to the intermediate output nodes 1155 and 1160.

In some embodiments, the select signals S2, S1, and S0 as well as theenable signal are a set of configuration data stored in configurationdata storage of the IC. In some embodiments, the configuration datastorage stores multiple configuration data sets. The multipleconfiguration data sets define the operation of the storage elementsduring differing clock cycles, where the clock cycles of someembodiments include user design clock cycles or sub-cycles of a userdesign clock cycle of a reconfigurable IC. Circuitry for retrieving aset of configuration data bits from configuration data storage isdisclosed in U.S. patent application Ser. No. 11/081,859.

The pull-up PMOS transistors 1125 and 1130 are used to pull-up quicklythe intermediate output nodes 1155 and 1160, and to regenerate thevoltage levels at the nodes that have been degenerated by the NMOSthreshold drops, when these nodes need to be at a high voltage. In otherwords, these pull-up transistors are used because the NMOS passtransistors are slower than PMOS transistors in pulling a node to a highvoltage. Thus, for instance, when the 6^(th) input signal is high, theenable signal is low, and the select bits are 011, the pass transistors1165-1175 start to pull node 1155 high and to push node 1160 low. Thelow voltage on node 1160, in turn, turns on the pull-up transistor 1125,which, in turn, accelerates the pull-up of node 1155.

The output buffer inverters 1135 and 1140 are used to isolate thecircuit 1100 from its load. Alternatively, these buffers may be formedby more than one inverter, but the feedback is taken from an invertingnode. The outputs of these buffers are the final output 1180 and 1185 ofthe multiplexer/latch circuit 1100. It should be noted that, in analternative implementation, the output buffers 1135 and 1140 arefollowed by multiple inverters.

The output of each buffer 1135 or 1140 is cross-coupling to the input ofthe other buffer through a cross-coupling NMOS transistor 1145 or 1150.These NMOS transistors are driven by the enable signal. Whenever theenable signal is low, the cross-coupling transistors are off, and hencethe output of each buffer 1135 or 1140 is not cross-coupling with theinput of the other buffer. Alternatively, when the enable signal ishigh, the cross-coupling transistors are ON, which cause them tocross-couple the output of each buffer 1135 or 1140 to the input of theother buffer. This cross-coupling causes the output buffers 1135 and1140 to hold the value at the output nodes 1180 and 1185 at their valuesright before the enable signal went active. Also, when the enable signalgoes active, the signal that drives the third set 1120 of passtransistors (i.e., the “AND'ing” of the complement of the enable signaland the first select bit S0) goes low, which, in turn, turns off thethird pass-transistor set 1120 and thereby turns off the multiplexingoperation of the multiplexer/latch circuit 1100.

In FIG. 11, the transistors 1145 and 1150 are cross-coupled at theoutput stage of the routing circuit. Alternatively, as illustrated inFIG. 12, some embodiments place the cross-coupled transistors 1145 and1150 in the routing fabric to establish a configurable storage elementwithin the routing fabric outside of the routing multiplexer (such asmultiplexer 1100). In FIG. 12, the routing multiplexer 1250 of someembodiments comprises sections 1105, 1110, 1115, and 1120 of FIG. 11. Inorder to isolate the signal within the storage element 1210 of therouting fabric, some embodiments place isolation devices 1220 within orimmediately before the storage element 1210. The isolation devicesprevent the input signals to the storage element 1210 from convergingwith the signals passing through the cross-coupled transistors 1145 and1150 of the storage element 1210 when the enable signal is asserted.Therefore, when the enable signal is asserted, the isolation devices1220 prevent further input signals from entering the storage element1210. Moreover, the asserted enable signal causes the cross coupledtransistors 1145 and 1150 to store the signal currently passing throughthe storage element 1210. Furthermore, a pair of level restorers 1230are used to quickly restore degraded high levels passing into thestorage element 1210 and to prevent leakage in the inverters 1240 thatthe level restorers are driving.

In some embodiments (e.g., some embodiments that are not runtimereconfigurable), the latch enable signal of FIGS. 10, 11, or 12(referred to as Latch Enable in FIG. 10 and ENABLE in FIGS. 11 and 12)is one configuration data bit for all clock cycles. In other embodiments(e.g., some embodiments that are runtime reconfigurable), this enablesignal corresponds to multiple configuration data sets, with each setdefining the operation of the storage elements 1005, 1190, and 1210during differing clock cycles. These differing clock cycles might bedifferent user design clock cycles, or different sub-cycles of a userdesign clock cycle or some other clock cycle.

In FIGS. 10 and 11, the operations of the multiplexers 1010 and1105-1120 are controlled by configuration data retrieved fromconfiguration data storage. In some embodiments (e.g., some embodimentsthat are not runtime reconfigurable), the configuration data for eachmultiplexer is one configuration data set for all clock cycles. In otherembodiments (e.g., some embodiments that are runtime reconfigurable),this configuration data corresponds to multiple configuration data sets,with each set defining the operation of the multiplexer during differingclock cycles, which might be different user design clock cycles, ordifferent sub-cycles of a user design clock cycle or some other clockcycle. U.S. patent application Ser. No. 11/081,859 discloses circuitryfor retrieving configuration data sets from configuration data storagein order to control the operation of interconnects and storage elements.

Other embodiments might construct the storage element 1210 differently(e.g., the storage element 1210 might not use isolation devices 1220and/or the level restorers 1230). Some embodiments might also use analternative circuit structure for defining storage elements outside ofRMUXs in the routing fabric.

B. Storage Via a Parallel Distributed Path

In different embodiments, storage elements can be defined at differentlocation in the routing fabric. FIGS. 13-29 illustrate several examples,though one of ordinary skill in the art will realize that it is, ofcourse, not possible to describe every conceivable combination ofcomponents or methodologies for different embodiments of the invention.One of ordinary skill in the art will recognize that many furthercombinations and permutations of the invention are possible.

FIG. 13A presents one exemplary embodiment of a routing fabric section1300 that performs routing and storage operations by distributing anoutput signal of a routing circuit 1310 through a parallel distributedpath (PDP) to a first input of a destination 1340, which in someembodiments might be (1) an input-select circuit for a logic circuit,(2) a routing circuit, or (3) some other type of circuit. The PDPincludes a first path and a second path. In some embodiments, the firstpath 1320 of the PDP directly connects the output of the routing circuit1310 to the destination 1340 (i.e., the first path 1320 is a directconnection that routes the output of the routing circuit directly to thedestination 1340).

In some embodiments, the second parallel path 1325 runs in parallel withthe first path 1320 and passes the output of the routing circuit 1310through a controllable storage element 1305, where the output may beoptionally stored (e.g., when the storage element 1305 is enabled)before reaching a second input of the destination 1340. In someembodiments, the connection between the circuit 1310 and storage element1305 and the connection between the storage element 1305 and the circuit1340 are direct connections.

As mentioned above, a direct connection is established through acombination of one or more wire segments and/or one or more vias. Insome of these embodiments, a direct connection might include interveningnon-configurable circuits, such as (1) intervening buffer circuits insome embodiments, (2) intervening non-buffer, non-configurable circuitsin other embodiments, or (3) a combination of such buffer and non-buffercircuits in yet other embodiments. In some embodiments, one or more ofthe connections between circuits 1310, 1305 and 1340 are configurableconnection.

Because of the second parallel path, the routing circuit 1310 of FIG.13A is used for only one clock cycle to pass the output into thecontrollable storage element 1305. Therefore, storage can be providedfor during the same clock cycle in which the routing operation occurs.Moreover, the PDP allows the output stage of the routing circuit 1310 toremain free to perform routing operations in subsequent clock cycleswhile storage occurs.

Some embodiments require the second parallel path of a PDP to reach(i.e., connect) to every destination that the first parallel path of thePDP reaches (i.e., connects). Some of these embodiments allow, however,the second parallel path to reach (i.e., to connect) destinations thatare not reached (i.e., that are not connected to) by the first parallelpath. FIG. 13B illustrates an example of this concept.

In FIG. 13B, the first path 1320 and the second path 1325 of the PDPconnect to the destination 1340. Additionally, the second path 1325connects (e.g., directly connects in some embodiments while configurablyconnecting in other embodiments) to an alternate destination 1350. Thisadditional connection to the destination 1350 permits the storageelement 1305 within the second path 1325 to provide storage for multipledestination circuits 1340 and 1350 without restricting the functionalityof the source circuit 1310 or the multiple destination circuits 1340 and1350. Moreover, the stored signal can be distributed to multipledestination circuits at different clock cycles without having tore-store the signal or store the signal at a different location. Forexample, path 1325 of FIG. 13B routes the signal within storage element1305 to destinations 1340 and 1350 at a first clock cycle. During thisfirst clock cycle, destination 1340 may elect to receive the signalwhile destination 1350 ignores the input from path 1325 until it isready to process the signal at a second clock cycle. The storage element1305 can nevertheless continue storing the signal until the second clockcycle at which time the destination 1350 receives the signal.

The controllable storage elements 1305 of FIGS. 13A and 13B controllablystore the value output from the routing circuit 1310. When the storageelement 1305 is enabled (e.g., receives a high enable signal) by the setof configuration data 1330, the storage element 1305 stores the outputof the routing circuit 1310. Storage may occur for multiple subsequentclock cycles as determined by the set of configuration data 1330. Duringstorage, the output path of the routing circuit 1310 remainsunrestricted, therefore permitting the routing fabric section 1300 tosimultaneously perform routing and storage operations. For instance, ata first clock cycle, the configuration data sets of the circuits 1305and 1310 cause the routing circuit 1310 to output one of its inputs andcause the storage element 1305 to store this output of the routingcircuit 1310. At a second clock cycle, the set of configuration data1330 can cause the routing circuit 1310 to output another value from thesame or different input than the input used in the first clock cycle,while the storage element 1305 continues storing the previous output.The output of the routing circuit 1310 generated during the second clockcycle is then routed to the destination 1340 via the first output path1320.

In some embodiments, the configuration data set 1330 for the storageelement 1305 come at least partly from configuration data storage of theIC. In some embodiments (e.g., some embodiments that are not runtimereconfigurable), the configuration data storage stores one configurationdata set (e.g., one bit or more than one bit) for all clock cycles. Inother embodiments (e.g., embodiments that are runtime reconfigurable andhave runtime reconfigurable circuits), the configuration data storage1330 stores multiple configuration data sets, with each set defining theoperation of the storage element during differing clock cycles. Thesediffering clock cycles might be different user design clock cycles, ordifferent sub-cycles of a user design clock cycle or some other clockcycle.

As shown in FIGS. 13A and 13B, the routing operations of the routingcircuit 1310 are controlled by configuration data. In some embodiments(e.g., some embodiments that are not runtime reconfigurable), thisconfiguration data is one configuration data set for all clock cycles.However, in other embodiments (e.g., some embodiments that are runtimereconfigurable circuits), the configuration data includes multipleconfiguration data sets, each set for defining the operation of therouting circuit 1310 during different clock cycles. The different clockcycles might be different user design clock cycles, or differentsub-cycles of a user design clock cycle or some other clock cycle. U.S.patent application Ser. No. 11/081,859 discloses circuitry forretrieving configuration data sets from configuration data storage inorder to control the operation of interconnects and storage elements.

FIGS. 14 and 15 present an implementation of the routing fabric section1300 with the direct connections of the parallel distributed path ofsome embodiments. As shown in FIG. 14, the parallel distributed outputpaths 1320 and 1325 from the routing circuit 1310 are generated by firstpassing the output of the routing circuit 1310 through a series ofinverters. In some embodiments, some or all of these inverters 1410 and1420 are separate from the routing circuit 1310. Alternatively, in someembodiments, some or all these inverters 1410 and 1420 are part of therouting circuit 1310 (e.g., are part of the output stage of the routingcircuit 1310).

In FIG. 14, the first path of the parallel distributed output 1320 isgenerated from the value of the second inverter 1420 which issubsequently routed to a destination. By passing the output of therouting circuit 1310 through a pair of inverters 1410 and 1420, thedestination receives the same output value it would have directlyreceived had the output of the routing circuit 1310 been directly routedto the destination. The second path of the parallel distributed output1325 is generated from the output of the first inverter 1410. In thismanner, the storage element 1305 receives the inverted output of therouting circuit 1310.

In some embodiments of the routing fabric section 1300 of FIGS. 13A and13B, the storage element 1305 may be implemented with any traditionalstorage element such as flip-flops, registers, latches, etc. However, inconjunction with FIG. 14, some embodiments must couple an inverter tothe storage element 1305 to restore the original output value of therouting circuit 1310 when outputting to the destination or otherdestinations through the second parallel path 1325. In other embodimentsof the routing fabric section 1300, instead of using traditional latchesfor the storage elements, some embodiments implement the storageelements using the CPL cross-coupling transistor implementation of FIG.11 or alternatively through a CMOS implementation.

FIGS. 15 and 16 illustrate one such CMOS implementation of the storageelement 1305 of FIG. 14. The storage element 1500 receives as input thesignal 1430 passing through the directly connected parallel path 1325with a source component and outputs the signal 1440 to the second pathdirectly connected to a destination component. The storage element 1305includes a pair of CMOS inverters 1520 and 1530 and a pair of tri-stateinverters 1510 and 1540, which, as further described below by referenceto FIGS. 16A and 16B, are controlled by an enable signal and itscomplement.

Inverters 1510, 1520, and 1530 are connected in series. When the enablesignal is high, the series of inverters 1510, 1520, and 1530 passthrough and invert the input from the parallel path 1325 after the inputhas passed through the inverter 1410 above. Upon output at the thirdinverter 1530, the original value of the multiplexer 1310 will have beenrestored. As shown in FIG. 13A, this restored original value will bepassed from the storage element 1305 and will continue along the secondparallel path 1325 until reaching destination 1340 or the multipledestinations 1340 and 1350 of FIG. 13B.

If the enable signal to the first tri-state inverter 1510 is low, thefirst tri-state inverter 1510 does not pass through and invert thesignal coming in from the second parallel path 1325. Instead, the firsttri-state inverter 1510 acts to isolate the storage element 1500 fromthe signal. FIG. 16A illustrates an example of a circuit implementationfor the first tri-state inverter 1510. The tri-state inverter 1510includes two NMOS transistors 1610, one which receives the input 1430and one which receives the enable signal. The tri-state inverter furtherincludes two PMOS transistors 1630, one which receives the input 1430and the other which receives the complement of the enable signal. InFIG. 16A, the tri-state inverter 1510 inverts the input 1430 when theenable signal is high and acts as an open circuit (e.g., open switch)when the enable signal is low.

FIG. 16B illustrates an example of a circuit implementation for thesecond tri-state inverter 1540. Unlike the first tri-state inverter1510, the second tri-state inverter 1540 is activated by a low enablesignal. By swapping the enable signal and the complement to the enablesignal, the second tri-state inverter 1540 has the oppositefunctionality to that of the first tri-state inverter 1510. Therefore,the second tri-state inverter 1540 acts as an open switch when theenable is high and acts as an inverter that sets up an invertingfeedback path between the output 1560 and input 1555 of the inverter1540 when the enable is low.

Moreover, because the inverter 1510 is not propagating the signal 1325when the signal is low, this coupling of invertors 1520 and 1540 createsa feedback path that stores a value within the circuit 1500 so long asthe enable signal remains low. During this time, the third inverter 1530will receive its input from the feedback path. Therefore, while theenable signal is low, the circuit 1500 will output at 1440 the valuestored within the feedback path to destination 1340 via the secondparallel path 1325.

Re-assertion of the enable signal (e.g., enable is high) stops theinverter 1540 from propagating the stored signal, effectively removingthe feedback path which causes the circuit 1500 to stop storing a value.Instead, a new value is input into the storage element 1500 via thefirst inverter 1510 which resumes signal propagation.

C. Storage Via a Feedback Path Connected in Series

In some embodiments, the routing fabric provides storage through storageelements located within a feedback path and/or at the output stage ofrouting circuits. For a particular routing circuit that connects aparticular source circuit to a particular destination circuit, theoutput of the particular routing circuit's storage element connects toan input of the destination circuit. When enabled, this storage holdsthe output of the source circuit for a particular duration (e.g., forone or more clock cycles). Typically, such a storage element is used tostore data for a relatively small amount of time as its storageoperation prevents the routing circuit from performing its routingoperation. Accordingly, at times, this storage element is referred tobelow as a short-term storage element.

In addition to placing a short-term storage element at the output stageof a routing circuit, some embodiments place a “long-term” storageelement in a feedback path between an output and input of the routingcircuit. Such a storage element is referred to as a long-term storageelement as it can be used to store data for a time duration that can berelatively long as the storage element does not disable the routingcircuit's routing operation. In other words, the placement of thestorage element in a feedback path of the routing circuit allows therouting circuit to continue performing its routing operations even whenthe storage element stores data. Moreover, by implementing the long termstorage within a feedback circuit, overall wire congestion needed forstorage within the routing fabric is reduced as only a single input isrequired at the destination to route an output signal or a previouslystored signal.

FIG. 17 illustrates an example of short and long term storage elements.The routing fabric section 1700 includes the short term configurablestorage element 1710 at the output stage of a source component 1740. Thesource 1740 is illustrated in FIG. 17 as an interconnect circuit (e.g.,a routing multiplexer or other routing circuit), though it should beapparent to one of ordinary skill in the art that the source 1740 mayinclude any configurable IC component which receives or distributessignals throughout the routing fabric. The second configurable storageelement, referred to as the long term storage, is implemented via thefeedback path 1730 which is connected in series to the short termstorage section 1710.

In some embodiments, the short term storage section 1710 operates in amanner similar to those described with respect to FIGS. 10 and 11. Theshort term storage 1710 receives an enable signal 1760. When the enablesignal 1760 is inactive, the circuit simply distributes the currentoutput to the destination 1750 and the feedback path 1730. In someembodiments, the connection from the short term storage 1710 to thedestination 1750 is a direct connection. When the enable signal 1760 isactive, the circuit acts as a latch that stores the current value andcontinually outputs that value so long as the enable signal 1760 remainsactive.

However, continued use of the short term storage 1710 causes the routingfabric section 1700 to perform only storage operations and thereforerestricts the routing functionality of the routing fabric section 1700.For example, storing a value within the short term storage 1710 forthree clock cycles prevents the routing circuit 1740 of the routingfabric section 1700 from performing routing operations for the later twoof the three clock cycles. Therefore, a second storage section 1720 isused for long term storage when storing a value for two or moresubsequent clock cycles.

The long term storage is implemented via the feedback path 1730 that isdirectly connected to the output of the short term storage element 1710.The feedback path 1730 routes the output of the routing circuit 1740through the controllable storage element 1720 which may store the outputbefore returning the output to the routing circuit 1740 through a seconddirect connection. The feedback path 1730 receives its input from theoutput of the short term storage 1710 which is directly distributed tothe destination 1750 at the same time that the output passes through thefeedback path 1730. By distributing the output of the routing circuit1740 through the feedback path 1730 which reenters the routing circuit1740, the storage element 1720 within the feedback path 1730 may storethe output value for several clock cycles without impeding the routingfunctionality of the routing fabric section 1700. The feedback paththerefore clears the routing path while simultaneously providing storageduring subsequent clock cycles.

As mentioned above, a direct connection is established through acombination of one or more wire segments and/or one or more vias. Insome of these embodiments, a direct connection might include interveningnon-configurable circuits, such as (1) intervening buffer circuits insome embodiments, (2) intervening non-buffer, non-configurable circuitsin other embodiments, or (3) a combination of such buffer and non-buffercircuits in yet other embodiments. In some embodiments, the feedbackpath 1730 includes a configurable connection (e.g., include aconfigurable connection between the long term storage 1720 and the inputof the circuit 1740).

In some embodiments, one configuration data set controls both the shortterm storage 1710 and the long term storage 1720 during each clock cycle(e.g., user-design clock cycle or sub-cycle). Accordingly, in theseembodiments, the long term storage 1720 stores the output value onlywhen the short term storage 1710 is not storing and vice versa. Forinstance, positive logic might enable the short term storage 1710 whilenegative logic might enable the long term storage 1720. By using oneconfiguration data set 1770 and its complement value, the total numberof configuration data needed to implement the storage elements of therouting fabric section is reduced. Moreover, it should be apparent toone of ordinary skill in the art that the configuration data set 1770 ofsome embodiments include different sets of configuration data to controleach storage element 1710 and 1720 (i.e., the configuration data neednot be shared between the storage elements 1710 and 1720). In some suchembodiments, the short and long term storage elements would not have tobe operated in a complementary manner in each cycle (i.e., one storageelement does not have to store a value during one cycle while the otherstorage element is transparent during that cycle, as both storageelements can be transparent or storing during any cycle).

In some embodiments, the configuration data set that control the short1710 and long 1720 term storage elements come at least partly fromconfiguration data storage of the IC. In some embodiments (e.g., someembodiments that are not runtime reconfigurable), the configuration datastorage stores one configuration data set (e.g., one bit or more thanone bit) for all clock cycles. In other embodiments (e.g., someembodiments that are runtime reconfigurable and have runtimereconfigurable circuits), the configuration data storage stores multipleconfiguration data sets, with each set defining the operation of thestorage elements 1710 and 1720 during a different clock cycle. Thedifferent clock cycles might be different user design clock cycles, ordifferent sub-cycles of a user design clock cycle or some other clockcycle.

As shown in FIG. 17, the routing operations of the routing circuit 1740are controlled by configuration data. In some embodiments (e.g., someembodiments that are not runtime reconfigurable), this configurationdata is one configuration data set for all clock cycles. However, inother embodiments (e.g., some embodiments that are runtimereconfigurable circuits), the configuration data includes multipleconfiguration data sets, each set for defining the operation of therouting circuit 1740 during different clock cycles. The different clockcycles might be different user design clock cycles, or differentsub-cycles of a user design clock cycle or some other clock cycle. U.S.patent application Ser. No. 11/081,859 discloses circuitry forretrieving configuration data sets from configuration data storage inorder to control the operation of interconnects and storage elements.

In the discussion below, multiple other embodiments (such as thoseillustrated in FIGS. 20, 24-27, and 29) are described which illustratetwo storage elements that are controlled from the same set ofconfiguration data. Like the embodiment illustrated in FIG. 17, theseother embodiments do not need to use one set of configuration data for apair of storage elements. Also, like the embodiment illustrated in FIG.17, the configuration data sets can include one or more bits for allcycles, or can include different bits for different clock cycles (e.g.,different configuration data sets for embodiments that are runtimereconfigurable and have runtime reconfigurable circuits).

FIG. 18 presents an embodiment for implementing the storagefunctionality of the routing fabric section 1700 of FIG. 17. As shown inthis figure, the circuit 1800 includes (1) a multiplexer 1810, (2) afirst pair of pull-up PMOS transistors 1820, (3) a first pair ofcross-coupling transistors 1830, (4) a first pair of inverting outputbuffers 1840, (5) an output pair of inverting output buffers 1845, (6) apair of NMOS pass gate transistors 1850, (7) a second pair of pull-upPMOS transistors 1855, (8) a second pair of cross-coupling transistors1860, and (9) a second pair of inverting output buffer 1870.

The sections 1880 and 1890 implement the short term storage and longterm storage elements of FIG. 17 using CPL implementation similar to theone discussed with respect to FIG. 11. The short term storage element1710 of FIG. 17 is implemented via the first pair of pull-up PMOStransistors 1820, the first pair of cross-coupling transistors 1830, andthe first pair of inverting output buffers 1840.

In some embodiments, the multiplexer 1810 is implemented in accordancewith circuit representation of FIG. 11 while omitting the set ofcross-coupled transistors 1145 and 1150 that provide storage at theoutput stage as well as the level restoring transistors 1125 and 1130.The multiplexer 1810 of such embodiments is formed by the four stages1105, 1110, 1115, and 1120 of FIG. 11. In such embodiments, the pull-upPMOS transistors 1820 are similar to the pull-up transistors 1125 and1130, as they are placed after stage 1120 of FIG. 11 and act as levelrestorers to quickly restore degraded high levels from the multiplexer1810 passing into the short term storage element 1880 and to preventleakage in the inverters 1840.

In some embodiments, the multiplexer 1810 internally includes the levelrestoring transistors 1820 to restore the output signal before passingthe values across the wire segments of the routing fabric. In otherembodiments, the multiplexer 1810 internally includes the PMOStransistors 1820, cross-coupled transistors 1830, and inverting buffers1840, like the multiplexer 1100 which internally includes the levelrestorers 1125 and 1130, cross-coupled transistors 1145 and 1150, andinverting buffers 1135 and 1140 of FIG. 11.

The long term storage element 1890 of some of these embodiments remainsseparate from the multiplexer 1810, while this storage element 1890 ispart of the multiplexer 1810 in other embodiments, as illustrated inFIG. 19. Specifically, FIG. 19 illustrates both the short and long termstorages 1880 and 1890 as part of the internal multiplexer structure1910.

The first pair of PMOS transistors 1820 receives the output of therouting circuit 1810 and its complementary value. As discussed above,the PMOS transistors 1820 regenerate the voltage levels that may havebeen degenerated by passing through the NMOS transistors at the outputstage of the multiplexer 1810 which results in a threshold drops. A lowvoltage on the complementary output of Mux_Out turns on the pull-uptransistor 1820 connected to the non-complementary Mux_Out, which inturn, accelerates the pull-up of the non-complementary Mux_Out anddrives those values to the positive rail. After passing through thepull-up transistors 1820, the outputs continue through the first pair ofinverting output buffers 1840, but also through the output pair ofinverting buffers 1845 which restore the output of the multiplexer toits original value.

When the enable bit is active (e.g., high in this example), the shortterm storage section 1880 will act as a latch storing a value. Theactive enable bit will cause the output inverters 1840 and the pair ofcross-coupling transistors 1830 to operate forming a pair ofcross-coupling inverters that hold and output the signal propagatingthrough the short term storage section 1880 prior to the enable bitbecoming active. The cross-coupling transistors 1830 cross-couple theoutput of each inverter buffer 1840 to the input of the other buffer.This cross-coupling causes the inverting buffers 1840 to hold the valueat the outputs 1875 right before the enable signal went active.

Similar to the implementation of FIG. 17, the same enable bitcontrolling the short term storage section 1880 also controls the longterm storage section 1890. The long term storage 1890 and short termstorage sections 1880 are comprised of the same components, namely apair of pass gate transistors 1850, a second pair of pull-up PMOStransistors 1855, a pair of cross-coupling transistors 1860, and a pairof inverting buffers 1870. One difference is that the long term storagesection 1890 receives its complementary set of inputs from thecomplementary set of outputs of the short term storage 1880. Anotherdifference is that the long term storage section 1890 routes itscomplimentary set of outputs back into the multiplexer 1810 as opposedto routing the outputs to some other destination 1875. As describedabove, by routing the outputs of the long term storage 1890 back intothe multiplexer 1810, a feedback path is created whereby a value maybestored for multiple clock cycles without impeding the routing operationsof the routing fabric section 1800.

Another difference is that the positive logic of the enable bit causesthe short term storage 1880 to perform storage operations while thenegative logic of the enable bit causes the long term storage 1890 toperform storage operations (e.g., when the enable signal is low, theoutput of the multiplexer 1810, to destination 1875, which goes throughthe short term storage element 1850 the long term storage latches thesignal at the output of the short term storage element 1850. Therefore,when the long term storage 1890 is performing storage operations, thepath through the short term storage 1880 remains clear for performingrouting operations.

It will be evident to one of ordinary skill in the art that the variouscomponents and functionality of FIGS. 19 and 18 may be implementeddifferently without diverging from the essence of the invention. Forexample, the cross-coupling storage elements 1880 and 1890 may bereplaced to include traditional D flip-flops.

D. Storage Via a Feedback Path Connected in Parallel

An alternative implementation of the routing fabric section of FIG. 17is the routing fabric section of FIG. 20. Similar to FIG. 17, FIG. 20presents an implementation of a routing fabric section 2000 in which ashort term storage section 2010 is connected to the output stage of arouting circuit 2040 and a long term storage section is in a feedbackpath 2030 between the output and input of the routing circuit 2040. Thestorage elements 2010 and 2020 are configurably controlled by the set ofconfiguration data 2070. In some embodiments, the storage elements 2010and 2020 share the same set of configuration data 2070, while in someother embodiments the storage elements 2010 and 2020 are controlled bydifferent sets of configuration data.

The difference between the routing fabric section 2000 and the routingfabric section 1700 is that the input to the feedback path 2030 does notpass through the short term storage section 2010. Rather, the feedbackpath 2030 is instead connected in parallel to the first output path ofthe routing circuit 2040. The output of the routing circuit 2040 istherefore distributed via two paths. This alternative approach allowsfor greater usage flexibility in the design of the routing fabric whilealso providing short and long term storage without the need to passthrough multiple storage elements. Therefore, storage can be achieved ina single clock operation.

In some embodiments of FIG. 20, the first output path of the routingcircuit 2040 directly connects to and passes through the short termstorage section 2010 en route to destination 2050. The second pathcontains a pair of direct connections. A first direct connectionconnects the output of the routing circuit 2040 to the input of thestorage element 2020. A second direct connection connects the output ofthe storage element 2020 back into the input of the routing circuit2040. In this manner, the direct connections of the second path createthe feedback path 2030 which returns the value of the routing circuit2040 back into the routing circuit 2040 without traversing the shortterm storage section 2010.

As mentioned above, a direct connection is established through acombination of one or more wire segments and/or one or more vias. Insome of these embodiments, a direct connection might include interveningnon-configurable circuits, such as (1) intervening buffer circuits insome embodiments, (2) intervening non-buffer, non-configurable circuitsin other embodiments, or (3) a combination of such buffer and non-buffercircuits in yet other embodiments. In some embodiments, the feedbackpath 2030 includes a configurable connection (e.g., include configurableconnection between the long term storage 2020 and the input of thecircuit 2040).

FIG. 21 presents an illustrative implementation of the routing fabricsection of FIG. 20. Similar to FIG. 18 above, FIG. 21 is a CPLimplementation of FIG. 20 including (1) a multiplexer 2110, (2) a firstpair of pull-up PMOS transistors 2120, (3) a first pair ofcross-coupling transistors 2130, (4) a first pair of inverting outputbuffers 2140, (5) a second pair of pull-up PMOS transistors 2150, (6) asecond pair of cross-coupling transistors 2160, (7) a second pair ofinverting output buffer 2170, and (8) a configuration data bit set(e.g., ENABLE and the complement of ENABLE) for controlling thecross-coupled transistors 2130 and 2160.

The short term storage section 2180 contains the first pair of pull-upPMOS 2120, the first pair of cross-coupling transistors 2130, and thefirst pair of inverting output buffers 2140. The first pair of PMOStransistors 2120 receives the output of the multiplexer 2110 and itscomplementary value. The PMOS transistors 2120 regenerate the voltagelevels that may have been degenerated by passing through NMOS thresholddrops at the output stage of the multiplexer 2110. A low voltage on thecomplementary output of Mux_Out turns on the pull-up transistor 2120connected to the non-complementary Mux_Out, which, in turn, acceleratesthe pull-up of the non-complementary Mux_Out. After passing through thepull-up transistors 2120, the outputs will continue through the firstpair of inverting output buffers 2140, before being output at terminals2175.

When the enable bit (e.g., configuration data set) is active, the shortterm storage section 2180 will act as a latch storing a value. Theactive enable bit will cause the output inverters 2140 and the pair ofcross-coupling transistors 2130 to operate forming a pair ofcross-coupling inverters that hold and output the signal propagatingthrough the short term storage section 2180 prior to the enable bitbecoming active. The cross-coupling transistors 2130 cross-couple theoutput of each inverter buffer 2140 to the input of the other buffer.This cross-coupling causes the inverting buffers 2140 to hold the valueat the outputs 2175 right before the enable signal went active.

The long term storage section 2190 is connected in parallel to theshort-term storage 2180. The parallel connection of the long termstorage 2190 requires the multiplexer 2110 to provide a parallel set ofoutputs. As illustrated in FIG. 21, the multiplexer 2110 outputs Mux_Outand its complement to the short term output 2180. Additionally,multiplexer 2110 outputs a parallel set of complementary outputs thatare provided along the wire segments 2155 and 2157.

FIG. 22 illustrates one implementation for the multiplexer 2110 of FIG.21, which generates parallel complementary set of outputs. Thismultiplexer is similar to the first four stages 1105, 1110, 1115, and1120 of multiplexer 1110 except that in FIG. 22, the parallelcomplementary outputs 2155 and 2157 are generated by introducing twoadditional pairs of NMOS pass gate transistors 2210 and 2220 which areactivated using the select bit S0 in conjunction with the EN signal. Theoutputs 2155 and 2157 are then passed into the long term storage section2190 which includes the same components as the short term storagesection 2180.

Moreover, the long term storage 2190 performs storage operations byusing the complementary value of the enable signal described above withreference to the short term storage 2180. Therefore, when the short termstorage 2180 is inactive and acts only to propagate the complementaryset of outputs of the multiplexer 2110, the long term storage is enabledand stores a parallel set of complementary outputs of the multiplexer2110 using the second pair of cross-coupling transistors 2160. Byrouting the outputs of the long term storage 2190 back into the routingcircuit 2110, a feedback path is created whereby a value maybe storedfor multiple clock cycles without impeding the routing operations of therouting circuit 2110. After passing through the controllable storageelement in the feedback path, the signals are re-routed back into theinputs 2175 and 2177 of multiplexer 2110.

In some embodiments, the configuration data controlling the short 2180and long 2190 term storage elements come at least partly fromconfiguration data storage of the IC. In some embodiments (e.g.,embodiments that are not runtime reconfigurable), the configuration datastorage stores one configuration data set for all clock cycles. In otherembodiments (e.g., embodiments that are runtime reconfigurable), theconfiguration data storage stores multiple configuration data sets, witheach set defining the operation of the storage elements 2180 and 2190during different clock cycles. The different clock cycles might bedifferent user design clock cycles, or different sub-cycles of a userdesign clock cycle or some other clock cycle.

As shown in FIG. 21, the routing operations of the routing circuit 2110are controlled by configuration data. In some embodiments (e.g., someembodiments that are not runtime reconfigurable), this configurationdata is one configuration data set for all clock cycles. However, inother embodiments (e.g., some embodiments that are runtimereconfigurable circuits), the configuration data includes multipleconfiguration data sets, each set for defining the operation of therouting circuit 2110 during different clock cycles. The different clockcycles might be different user design clock cycles, or differentsub-cycles of a user design clock cycle or some other clock cycle. U.S.patent application Ser. No. 11/081,859 discloses circuitry forretrieving configuration data sets from configuration data storage inorder to control the operation of interconnects and storage elements.

In some embodiments, the multiplexer 2110 not only includes the circuitsillustrated in FIG. 22, but also internally includes the level restorers2120 to restore the output signal before passing the values across thewire segments of the routing fabric. In other embodiments, themultiplexer 2110 internally includes the PMOS transistors 2120,cross-coupled transistors 2130, and inverting buffers 2140, like themultiplexer 1100 which internally includes the level restorers 1125 and1130, cross-coupled transistors 1145 and 1150, and inverting buffers1135 and 1140 of FIG. 11.

The long term storage element 2190 of some of these embodiments remainsseparate from the multiplexer 2110, while this storage element 2190 ispart of the multiplexer 2110 in other embodiments, as illustrated inFIG. 23. Specifically, FIG. 23 illustrates both the short and long termstorages 2180 and 2190 as part of the internal multiplexer structure2310. It will be evident to one of ordinary skill in the art that thevarious components and functionality of FIGS. 23 and 21 may beimplemented differently without diverging from the essence of theinvention.

FIG. 24A presents an alternative embodiment to FIG. 17 in which theoutput of the multiplexer 2440 is passed to a short term storage element2405 before passing to the destination 2460 and the feedback loop 2420where the output may alternatively appear at a destination 2465. In thismanner, the output from multiplexer 2440 can be stored in one section ofthe routing fabric (e.g. storage element 2430) and appear at adestination 2465 along a different portion of the routing fabric. Insome embodiments, the connections between the storage element 2405 andthe destination 2460, between the storage element 2405 and the storageelement 2430, and between the storage element 2430 and the destination2465 are direct connections. However, in some embodiments, some of theconnections are configurable connections (e.g., the connection betweenstorage element 2430 and destination 2465 might be configurable).

Moreover, because the embodiment of FIG. 24A does not include theparallel distributed path of FIGS. 13A and 13B, this embodiment is nolonger restricted to routing the same signal along multiple paths. Forexample, in FIG. 13B, when the source circuit 1310 routes a signal todestination 1340 along wire segment 1320, the parallel distributed pathwould require the signal to similarly pass through wire segments 1325.Using some embodiments of FIG. 24A, a signal passes from source circuit2440 to destination 2460 without having to pass an additional signalfrom the feedback loop back to destination 2460. Rather, in theseembodiments the signal may pass to the destination 2460 along one pathand an alternate destination 2465 along another (e.g., where thealternate path includes the feedback path 2420).

FIG. 24B presents still another embodiment of the routing fabric section2000 of FIG. 20. In this figure, a first parallel output path ofmultiplexer 2440 is routed to a first destination 2460. The secondparallel output path 2470 of multiplexer 2440 is routed through thefeedback path 2470 back into the multiplexer 2440 and alternatively to asecond destination 2465. In this manner, multiple destinations 2460 and2465 can receive a stored value of a single source 2440. Moreover, thesame term storage element 2430 can store different values of the source2440 for processing by different destinations 2460 and 2465 at differentclock cycles. For instance, at a first clock cycle, the storage element2430 stores a value for destination 2460 and feeds that stored value todestination 2460 at a second clock cycle. At a third clock cycle, thestorage element 2430 can alternatively store a value for destination2465 which receives the stored value at the fourth clock cycle.

In some embodiments, the connections in FIG. 24B between the storageelement 2405 and the destination 2460, between the routing circuit 2440and the storage element 2430, and between the storage element 2430 andthe destination 2465 are direct connections. However, it should beapparent to one of ordinary skill in the art that in some embodiments,some of the connections are configurable connections. For example, theconnections between the storage element 2405 and the destination 2460,between the storage element 2430 and the destination 2465, or both areconfigurable connections.

In FIG. 24B, the storage element 2430 was illustrated within thefeedback path 2470. Alternatively, as illustrated in FIG. 25, someembodiments locate the storage element 2530 at the output stage of therouting circuit 2540, similar to the first storage element 2505. In someembodiments of FIG. 25, the connection between the storage element 2505and the destination circuit 2560 and the connection between the storageelement 2530 and the routing circuits 2540 and 2565 are directconnections. However, in some embodiments, some of these connections areconfigurable connections. For instance, the connection between thestorage element 2505 and the destination circuit 2560 the connectionbetween the storage element 2530 and the destination circuit 2565, orboth are configurable.

In some embodiments, the storage elements 2405 and 2430 of FIGS. 24A and24B and the storage elements 2505 and 2530 of FIG. 25 share the same setof configuration data, while in some other embodiments the storageelements are controlled by different sets of configuration data. In someembodiments, the configuration data sets that control the storageelements of FIGS. 24A, 24B, and 25 come at least partly fromconfiguration data storage of the IC. In some embodiments (e.g., someembodiments that are not runtime reconfigurable), the configuration datastorage stores one configuration data set (e.g., one bit or more thanone bit) for all clock cycles. In other embodiments (e.g., embodimentsthat are runtime reconfigurable and have runtime reconfigurablecircuits), the configuration data storage stores multiple configurationdata sets, with each set defining the operation of the storage elementsduring different clock cycles. The different clock cycles might bedifferent user design clock cycles, or different sub-cycles of a userdesign clock cycle or some other clock cycle,

As shown in FIGS. 24A, 24B, and 25, the routing operations of therouting circuits are controlled by configuration data. In someembodiments (e.g., some embodiments that are not runtimereconfigurable), this configuration data is one configuration data setfor all clock cycles. However, in other embodiments (e.g., someembodiments that are runtime reconfigurable circuits), the configurationdata includes multiple configuration data sets, each set for definingthe operation of the routing circuits of FIGS. 24A, 24B, and 25 duringdifferent clock cycles. The different clock cycles might be differentuser design clock cycles, or different sub-cycles of a user design clockcycle or some other clock cycle. U.S. patent application Ser. No.11/081,859 discloses circuitry for retrieving configuration data setsfrom configuration data storage in order to control the operation ofinterconnects and storage elements.

In some embodiments, the storage elements 2505 and 2530 are either bothlocated within the routing circuit 2540 or alternatively one storageelement is located at the output stage of the routing circuit 2540 whilethe other storage element is an internal component of the circuit 2540.It should be apparent to one of ordinary skill in the art that in someembodiments the feedback paths of FIGS. 24A, 24B, and 25 need not routeto both the multiplexer (2440 or 2540) and a second destination (2465 or2565). In some such embodiments, the output of storage elements 2430 or2530 are routed only to the respective destination 2465 or 2565 and notback into the multiplexer 2440 or 2540.

FIG. 26 presents yet another embodiment of some invention. In FIG. 26,the feedback path 2570 and the parallel set of outputs from the routingcircuit 2540 of FIG. 25 are removed. Instead, a single output from themultiplexer 2640 is distributed in two parallel paths. Each pathcontains a storage element 2605 and 2630, however neither path is aprimary signal path. The output from the first storage element 2605 isdirectly connected 2610 to a first destination circuit 2660 and theoutput from the second storage element 2630 is directly connected 2670to a second destination circuit 2665. However, one of ordinary skill inthe art will recognize that in some cases the two parallel paths mightnot end at the two destinations 2660 and 2665, but instead at a singledestination circuit. In this manner, the circuit resembles the circuitsof FIGS. 13A and 13B, though the inclusion of the second storage elementameliorates timing issues related to having a first path with a storageelement and a second path without a storage element.

As mentioned above, the direct connections of FIG. 24-26 may beestablished through a combination of one or more wire segments and/orone or more vias. In some of these embodiments, a direct connectionmight include intervening non-configurable circuits, such as (1)intervening buffer, non-configurable circuits in some embodiments, (2)intervening non-buffer circuits in other embodiments, or (3) acombination of such buffer and non-buffer circuits in yet otherembodiments. In some embodiments, one or more of the connections betweencircuits 2640, 2605, 2630, 2660, and 2665 are configurable connections.For instance the connection between storage element 2605 and thedestination 2660, storage element 2630 and the destination 2665, or bothcan be configurable.

In FIG. 26, the same set of configuration data 2650 is used to controlboth storage elements 2605 and 2630. In some embodiments, the storageelement 2605 latches when the set of configuration data 2650 is high andthe storage element 2630 latches when the set of configuration data 2650is low. In this manner, one path of the parallel distributed pathperforms storage operations and the other path routes signals to andfrom the source circuit 2640 to a destination 2660 or 2665. Therefore,the circuit of FIG. 26 transparently provides routing and storageoperations within the routing fabric. However, it should be apparent toone or ordinary skill in the art that some embodiments do not use thesame set of configuration data 2650 to control each storage element 2605and 2630.

In some embodiments, the configuration data sets that control thestorage elements of FIG. 26 come at least partly from configuration datastorage of the IC. In some embodiments (e.g., some embodiments that arenot runtime reconfigurable), the configuration data storage stores oneconfiguration data set (e.g., one bit or more than one bit) for allclock cycles. In other embodiments (e.g., some embodiments that areruntime reconfigurable and have runtime reconfigurable circuits), theconfiguration data storage stores multiple configuration data sets, witheach set defining the operation of the storage elements during differingclock cycles. These differing clock cycles might be different userdesign clock cycles, or different sub-cycles of a user design clockcycle or some other clock cycle.

As shown in FIG. 26, the routing operations of the routing circuit 2640are controlled by configuration data. In some embodiments (e.g., someembodiments that are not runtime reconfigurable), this configurationdata is one configuration data set for all clock cycles. However, inother embodiments (e.g., some embodiments that are runtimereconfigurable circuits), the configuration data includes multipleconfiguration data sets, each set for defining the operation of therouting circuit 2640 during different clock cycles. The different clockcycles might be different user design clock cycles, or differentsub-cycles of a user design clock cycle or some other clock cycle. U.S.patent application Ser. No. 11/081,859 discloses circuitry forretrieving configuration data sets from configuration data storage inorder to control the operation of interconnects and storage elements.

FIG. 26 is illustrated with a single path output from the multiplexer2640, though some embodiments of the circuit 2640 produce the parallelpaths directly from the circuit 2640. A first output of the paralleloutput path directly connects to storage element 2605 and a secondoutput of the parallel output path directly connects to the storageelement 2630. An implementation of such a multiplexer 2640 includes insome embodiments, the multiplexer 2110 of FIG. 21 where the second pairof parallel outputs 2155 and 2157 are directly connected to the secondstorage element 2190. However, in an implementation consistent with FIG.26, the outputs from the second storage element 2190 would be directlyconnected a second destination instead of feeding back into themultiplexer 2110. Moreover, in some embodiments of FIG. 26, the storageelements 2605 and 2630 are built into the output stage of themultiplexer 2640 similar to the storage elements 2180 and 2190 of FIG.23 without feeding back into the multiplexer 2640.

FIG. 27 conceptually illustrates how some embodiments of the inventionuse uncongested areas within the routing fabric to store data and toroute data to desired destinations. Some embodiments use the feedbackpath 2720 to provide values from the multiplexer 2740 to the storageelement 2730. However, the different destinations 2760 and 2765 may needdifferent values to be stored within the storage element 2730. Forinstance, at a first clock cycle, the output from source 2740 may needto be stored for three subsequent clock cycles before arriving atdestination 2765, therefore the value is stored in the storage element2730 located within the feedback path. During a second clock cycle, theoutput from source 2740 needs to be stored for two subsequent clockcycles before arriving at destination 2760. However, the first output iscurrently being stored within the storage element 2730.

In order to free the storage element 2730, but nevertheless provide longterm storage for the first output, some embodiments of FIG. 27 pass thefirst stored value within the storage element 2730 to an unused storageelement 2770 located elsewhere within the routing fabric. In thismanner, the storage element 2730 is now available to store the signaloutput from the multiplexer 2740 at the second clock cycle. So long asneither storage element 2730 or 2770 is needed during the third clockcycle, these storage elements continue storing their respective values.Then at the fourth clock cycle, the signal stored within storage element2770 is released and routed to destination 2765 and the signal storedwithin storage element 2730 is released and routed to destination

However, if the storage elements 2730 or 2770 are used for storing othersignals or the wire segments upon which the storage elements are locatedare used for routing other signals, then the storage elements 2730 or2770 may first pass the stored values to other unused storage elementselsewhere within the routing fabric. In this manner, the storage elementand the wiring path on which the storage element is located is freed andstorage is provided for at another unused storage element within therouting fabric.

In some embodiments, one or more of the connections between the variouscircuits illustrated in FIG. 27 are configurable connections. However,in some embodiments, the connections between the storage element 2705and the destination 2760, between the routing circuit 2740 and thestorage element 2730, between the storage element 2730 and the routingcircuit 2740, and between the storage element 2730 and the storageelement 2770 are direct connections. Additionally, in some embodiments,one or more of these direct connections are long offset directconnections. Such connections are further described below.

As indicated above, the connections between storage elements 2730 and2770 in FIG. 27 allow data to be stored while being routed to desiredlocations through uncongested areas of the routing fabric. FIG. 28conceptually illustrates an example of such storage and passing of astored signal from one storage element to another unused storage elementin order to free the storage element or the routing path on which thestorage element is located for use by other circuits of the IC. Forinstance, at a first clock cycle, a signal is passed from a sourcecircuit element 2740 to a storage element 2730 for long term storageuntil a fourth clock cycle at which point the signal is to arrive at adestination circuit element 2765. However, because the storage element2730 is required to store the value passed from an alternate circuitelement during a second clock cycle, the storage element 2730 releasesthe previously stored value and routes the value to a second unusedstorage element 2770. The storage element 2730 is now available toprovide storage at the second clock cycle for the alternate circuitelement.

At the third clock cycle, the wiring path on which the second storageelement 2770 is located is required to route signals from other circuitsof the IC. Therefore, the second storage element 2770 releases thestored value to a third unused storage element 2780 to provide storagefor the previously stored value during the third clock cycle. With thesecond storage element 2770 no longer providing storage, the path isclear for a signal to be routed from other circuits within the IC. Atthe fourth clock cycle, the stored value is routed from the thirdstorage element 2780 to the destination circuit 2765.

Such operations maximize the usage of the existing storage elementswithin the routing fabric without requiring additional storage elementsand also without congesting wiring paths which in some embodiments maybe required for routing other signals from other circuits of theconfigurable IC. Moreover, the circuit elements of the IC can continueto perform routing operations irrespective of whether storage forprevious values output from the circuit elements is being performedwithin the routing fabric. As noted above, in different embodiments, therouting fabric includes (1) a combination of wire segments, (2) acombination of wire segments and vias, (3) a combination of wiresegments, vias, and buffers, but no intervening configurableinterconnect circuits, or (4) a combination of wire segments, vias, andintervening non-configurable interconnect circuits.

Even though FIGS. 27 and 28 illustrate the concept of storing androuting data to desired locations through uncongested areas of therouting fabric by reference to the storage elements illustrated in FIG.27, other embodiments might use this same approach with other storageelements discussed above (e.g., with the storage elements illustrated inFIGS. 13, 17, 20, 24A, 25, and 26). Moreover, even through FIG. 27illustrates 2770 as a standalone storage element, this storage elementmight be at the output of another circuit, such as another configurableinterconnect. FIG. 29 illustrates one such example.

Specifically, FIG. 29 illustrates an alternative embodiment of FIG. 27in which the storage element 2770 of FIG. 27 is removed and insteadreplaced with a second short term 2920 and long term 2940 storagecircuit. Though the components and wiring between FIG. 29 and FIG. 27are similar, FIG. 29 illustrates a connection between such circuitswithin the routing fabric. By connecting two such circuits, the longterm storage capabilities of one circuit are expanded so that thecircuit can utilize unused storage elements of another circuit. One ofordinary skill in the art will recognize that even though FIG. 29illustrates two communicatively connected circuits, some embodimentsinclude several such circuits.

As described above, such functionality is necessary when a circuit mustprovide long term storage for multiple destinations at the same time.Therefore, if the storage element 2730 is already used but is needed toprovide long term storage for a different signal and/or destination ofcircuit 2740, then storage element 2730 may release the previouslystored value to the storage element 2940 provided that storage element2940 is unused. In this manner, signals originated from circuit 2740 arestored in the storage element 2730 within its own feedback path andstorage element 2940 within the feedback path of circuit 2910. Suchinterconnection between storage elements within different segments ofthe routing fabric makes available the storage resources of differentsegments of the routing fabric to circuits that otherwise would requireadditional storage elements within their own direct connection.

Though FIG. 29 has been illustrated with storage elements 2705, 2730,2920, and 2940, one of ordinary skill in the art will recognize thatseveral other variations are possible. For instance, these storageelements may be located in a manner similar to the storage elements 2605and 2630 of FIG. 26. Moreover, in some embodiments the storage elements2605 and 2630 may be included in addition to the existing storageelements of FIG. 27 or FIG. 29. In this manner the storage elements 2605and 2630 can work in tandem with storage elements 2730 and 2770 of FIG.27 or in tandem with the storage elements 2730 and 2920/2940 of FIG. 29.Similarly, instead of storage elements 2920 and 2940 after the routingcircuit 2910, the storage elements that precede the routing circuit 2910might be those of the PDP's illustrated in FIGS. 13-15.

In FIG. 29, all the connections are direct connections in someembodiments, while one or more of them are configurable connections inother embodiments. Moreover, some of the direction connections (e.g.,the connection between circuits 2730 and 2910) in this figure can beimplemented as direct long offset connections.

In some embodiments, direct long offset connections (also referred to aslong-offset direct connections) are direct connections between twonon-neighboring nodes that are not vertically or horizontally aligned.In some embodiments, the two nodes are two configurable circuits (e.g.,circuits 2730 and 2910), which in some of these embodiments the twocircuits are arranged in an array with other configurable circuits. Inother embodiments, the two nodes are two configurable tiles that includethe two directly connected circuits (e.g., the tile that includescircuit 2730 and the tile that includes the circuit 2910). In someembodiments, two nodes are not neighboring nodes when they are notadjacent to each other in the vertical, horizontal, or diagonaldirections. Accordingly, the two nodes that are connected by a directlong offset connection are two nodes that are not vertically orhorizontally aligned and that have at least one other node between them.

A direct long offset connection is a direct connection. As mentionedabove, a direct connection is established through a combination of oneor more wire segments and/or one or more vias. In some of theseembodiments, a direct connection might include interveningnon-configurable circuits, such as (1) intervening buffer circuits insome embodiments, (2) intervening non-buffer, non-configurable circuitsin other embodiments, or (3) a combination of such buffer and non-buffercircuits in yet other embodiments.

Even though direct long offset connections were described above byreference to FIGS. 27 and 29, one of ordinary skill will realize thatsuch connections can be used to implement the circuit structuresillustrated in some of the other figures. For example, some or all theconnections between the circuits mentioned above (e.g., between circuits1310 and 1340, 1305 and 1340, 2405 and 2460, 2430 and 2465, 2505 and2560, 2530 and 2565, 2605 and 2660, 2630 and 2665, and 2705 and 2760)may be implemented as long offset direct connections. Examples forimplementing long offset direct connections are described U.S. Pat. No.7,193,438. U.S. Pat. No. 7,193,438 is incorporated herein by reference.

While the above discussion has illustrated some embodiments of storageelements applicable to a configurable IC, it should be apparent to oneof ordinary skill in the art that some embodiments of the storageelements and routing circuits are similarly applicable to areconfigurable IC. Therein, some embodiments of the invention implementthe components within FIGS. 13A, 13B, 17, 24-27, and 29 with multiplesets of configuration data to operate on a sub-cycle reconfigurablebasis. For example, the storage elements for the sets of configurationdata in these figures (e.g., a set of memory cells, such as SRAM cells)can be modified to implement switching circuits in some embodiments. Theswitching circuits receive a larger set of configuration data that arestored internally within the storage elements of the switching circuits.The switching circuits are controlled by a set of reconfigurationsignals. Whenever the reconfiguration signals change, the switchingcircuits supply a different set of configuration data to the routingcircuits, such as the multiplexers and the selectively enabled storageelements within the routing fabric sections.

The sets of configuration data then determine the connection scheme thatthe routing circuits 1310, 1740, 2040, 2440, 2540, and 2740 of someembodiments use. Furthermore, the sets of configuration data determinethe set of storage elements for storing the output value of the routingcircuits. This modified set of switching circuits therefore adapts therouting fabric sections of FIGS. 13A, 13B, 17, 24-27, and 29 forperforming simultaneous routing and storage operations within asub-cycle reconfigurable IC.

While numerous storage element circuits have been described withreference to numerous specific details, one of ordinary skill in the artwill recognize that such circuits can be embodied in other specificforms without departing from the spirit of the invention. For instance,several embodiments were described above by reference to particularnumber of circuits, storage elements, inputs, outputs, bits, and bitlines. One of ordinary skill will realize that these elements aredifferent in different embodiments. For example, routing circuits andmultiplexers have been described with n logical inputs and only onelogical output, where n is greater than one. However, it should beapparent to one of ordinary skill in the art that the routing circuits,multiplexers, IMUXs, and other such circuits may include n logicalinputs and m logical outputs where m is greater than one.

Moreover, though storage elements have been described with reference torouting circuits (RMUXs), it will be apparent to one of ordinary skillin the art that the storage elements might equally have been describedwith reference to input-select multiplexers such as the interconnectcircuits (IMUXs) described above. Similarly, the routing circuitsillustrated in the figures, such as the 8-to-1 multiplexer of FIG. 11,may alternatively be described with reference to IMUXs.

The storage elements of some embodiments are state elements that canmaintain a state for one or more clock cycles (user-design clock cyclesor sub-cycles). Therefore, when storing a value, the storage elements ofsome embodiments output the stored value irrespective of the value atits input. Moreover, some embodiments have referred to the storageelements as “short term” or “long term” storage elements (e.g., thestorage elements 1710 and 1720 of FIG. 17), however, it should beapparent to one of ordinary skill in the art that such terminologydescribes one type of use for the storage elements. For instance, thestorage element 1710 need not store for only one clock cycle (e.g.,user-design clock or sub-cycle clock) or store for a short term.Similarly, the storage element 1720 need not be used only for long termstorage.

Moreover, even though some embodiments described above showed storagefunctionality at the output stage of the RMUXs, one of ordinary skill inthe art will recognize that such functionality can be placed within orat the input stage of the RMUXs or within or at the input stage ofIMUXs. Similarly, the source and destination circuits described withreference to the various figures can be implemented using IMUXs. Thus,one of ordinary skill in the art would understand that the invention isnot to be limited by the foregoing illustrative details.

V. Configurable IC and System

Some embodiments described above are implemented in configurable ICsthat can compute configurable combinational digital logic functions onsignals that are presented on the inputs of the configurable ICs. Insome embodiments, such computations are state-less computations (i.e.,do not depend on a previous state of a value). Some embodimentsdescribed above are implemented in configurable ICs that can perform acontinuous function. In these embodiments, the configurable IC canreceive a continuous function at its input, and in response, provide acontinuous output at one of its outputs.

FIG. 30 illustrates a portion of a configurable IC 3000 of someembodiments of the invention. As shown in this figure, this IC has aconfigurable circuit arrangement 3005 and I/O circuitry 3010. Theconfigurable circuit arrangement 3005 can include any of the abovedescribed circuits, storage elements, and routing fabric of someembodiments of the invention. The I/O circuitry 3010 is responsible forrouting data between the configurable nodes 3015 of the configurablecircuit arrangement 3005 and circuits outside of this arrangement (i.e.,circuits outside of the IC, or within the IC but outside of theconfigurable circuit arrangement 3005). As further described below, suchdata includes data that needs to be processed or passed along by theconfigurable nodes.

The data also includes in some embodiments a set of configuration datathat configures the nodes to perform particular operations. FIG. 31illustrates a more detailed example of this. Specifically, this figureillustrates a configuration data pool 3105 for the configurable IC 3000.This pool includes N configuration data sets (CDS). As shown in FIG. 31,the input/output circuitry 3010 of the configurable IC 3000 routesdifferent configuration data sets to different configurable nodes of theIC 3000. For instance, FIG. 31 illustrates configurable node 3145receiving configuration data sets 1, 3, and J through the I/O circuitry,while configurable node 3150 receives configuration data sets 3, K, andN−1 through the I/O circuitry. In some embodiments, the configurationdata sets are stored within each configurable node. Also, in someembodiments, a configurable node can store multiple configuration datasets for a configurable circuit within it so that this circuit canreconfigure quickly by changing to another configuration data set for aconfigurable circuit. In some embodiments, some configurable nodes storeonly one configuration data set, while other configurable nodes storemultiple such data sets for a configurable circuit.

A configurable IC of the invention can also include circuits other thana configurable circuit arrangement and I/O circuitry. For instance, FIG.32 illustrates a system on chip (“SoC”) implementation of a configurableIC 3200. This IC has a configurable block 3250, which includes aconfigurable circuit arrangement 3005 and I/O circuitry 3010 for thisarrangement. It also includes a processor 3215 outside of theconfigurable circuit arrangement, a memory 3220, and a bus 3210, whichconceptually represents all conductive paths between the processor 3215,memory 3220, and the configurable block 3250. As shown in FIG. 32, theIC 3200 couples to a bus 3230, which communicatively couples the IC toother circuits, such as an off-chip memory 3225. Bus 3230 conceptuallyrepresents all conductive paths between the components of the IC 3200.

This processor 3215 can read and write instructions and/or data from anon-chip memory 3220 or an offchip memory 3225. The processor 3215 canalso communicate with the configurable block 3250 through memory 3220and/or 3225 through buses 3210 and/or 3230. Similarly, the configurableblock can retrieve data from and supply data to memories 3220 and 3225through buses 3210 and 3230.

Instead of, or in conjunction with, the system on chip (“SoC”)implementation for a configurable IC, some embodiments might employ asystem in package (“SiP”) implementation for a configurable IC. FIG. 33illustrates one such SiP 3300. As shown in this figure, SiP 3300includes four ICs 3320, 3325, 3330, and 3335 that are stacked on top ofeach other on a substrate 3305. At least one of these ICs is aconfigurable IC that includes a configurable block, such as theconfigurable block 3250 of FIG. 32. Other ICs might be other circuits,such as processors, memory, etc.

As shown in FIG. 33, the IC communicatively connects to the substrate3305 (e.g., through wire bondings 3360). These wire bondings allow theICs 3320-3335 to communicate with each other without having to gooutside of the SiP 3300. In some embodiments, the ICs 3320-3335 might bedirectly wire-bonded to each other in order to facilitate communicationbetween these ICs. Instead of, or in conjunction with the wire bondings,some embodiments might use other mechanisms to communicatively couplethe ICs 3320-3335 to each other.

As further shown in FIG. 33, the SiP includes a ball grid array (“BGA”)3310 and a set of vias 3315. The BGA 3310 is a set of solder balls thatallows the SiP 3300 to be attached to a printed circuit board (“PCB”).Each via connects a solder ball in the BGA 3310 on the bottom of thesubstrate 3305, to a conductor on the top of the substrate 3305.

The conductors on the top of the substrate 3305 are electrically coupledto the ICs 3320-3335 through the wire bondings. Accordingly, the ICs3320-3335 can send and receive signals to and from circuits outside ofthe SiP 3300 through the wire bondings, the conductors on the top of thesubstrate 3305, the set of vias 3315, and the BGA 3310. Instead of aBGA, other embodiments might employ other structures (e.g., a pin gridarray) to connect a SiP to circuits outside of the SiP. As shown in FIG.33, a housing 3380 encapsulates the substrate 3305, the BGA 3310, theset of vias 3315, the ICs 3320-3335, the wire bondings to form the SiP3300. This and other SiP structures are further described in U.S. patentapplication Ser. No. 11/081,820 entitled “Programmable System InPackage”, which is incorporated herein by reference.

FIG. 34 conceptually illustrates a more detailed example of a computingsystem 3400 that has an IC 3405, which includes a configurable circuitarrangement with configurable circuits, storage elements, and routingfabric of some embodiments of the invention that were described above.The system 3400 can be a stand-alone computing or communication device,or it can be part of another electronic device. As shown in FIG. 34, thesystem 3400 not only includes the IC 3405, but also includes a bus 3410,a system memory 3415, a read-only memory 3420, a storage device 3425,input devices 3430, output devices 3435, and communication interface3440.

The bus 3410 collectively represents all system, peripheral, and chipsetinterconnects (including bus and non-bus interconnect structures) thatcommunicatively connect the numerous internal devices of the system3400. For instance, the bus 3410 communicatively connects the IC 3410with the read-only memory 3420, the system memory 3415, and thepermanent storage device 3425. The bus 3410 may be any of several typesof bus structure including a memory bus or memory controller, aperipheral bus, and a local bus using any of a variety of conventionalbus architectures. For instance, the bus 3410 architecture may includeany of the following standard architectures: PCI, PCI-Express, VESA,AGP, Microchannel, ISA and EISA, to name a few.

From these various memory units, the IC 3405 receives data forprocessing and configuration data for configuring the ICs configurablelogic and/or interconnect circuits. When the IC 3405 has a processor,the IC also retrieves from the various memory units instructions toexecute. The read-only-memory (ROM) 3420 stores static data andinstructions that are needed by the IC 3405 and other modules of thesystem 3400.

Some embodiments of the invention use a mass-storage device (such as amagnetic disk to read from or write to a removable disk or an opticaldisk for reading a CD-ROM disk or to read from or write to other opticalmedia) as the permanent storage device 3425. Other embodiments use aremovable storage device (such as a flash memory card or memory stick)as the permanent storage device. The drives and their associatedcomputer-readable media provide non-volatile storage of data, datastructures, computer-executable instructions, etc. for the system 3400.Although the description of computer-readable media above refers to ahard disk, a removable magnetic disk, and a CD, it should be appreciatedby those skilled in the art that other types of media which are readableby a computer, such as magnetic cassettes, digital video disks, and thelike, may also be used in the exemplary operating environment.

Like the storage device 3425, the system memory 3415 is a read-and-writememory device. However, unlike storage device 3425, the system memory isa volatile read-and-write memory, such as a random access memory.Typically, system memory 3415 may be found in the form of random accessmemory (RAM) modules such as SDRAM, DDR, RDRAM, and DDR-2. The systemmemory stores some of the set of instructions and data that theprocessor needs at runtime.

The bus 3410 also connects to the input and output devices 3430 and3435. The input devices enable the user to enter information into thesystem 3400. The input devices 3430 can include touch-sensitive screens,keys, buttons, keyboards, cursor-controllers, touch screen, joystick,scanner, microphone, etc. The output devices 3435 display the output ofthe system 3400. The output devices include printers and displaydevices, such as cathode ray tubes (CRT), liquid crystal displays (LCD),organic light emitting diodes (OLED), plasma, projection, etc.

Finally, as shown in FIG. 34, bus 3410 also couples system 3400 to otherdevices through a communication interface 3440. Examples of thecommunication interface include network adapters that connect to anetwork of computers, or wired or wireless transceivers forcommunicating with other devices. Through the communication interface3440, the system 3400 can be a part of a network of computers (such as alocal area network (“LAN”), a wide area network (“WAN”), or an Intranet)or a network of networks (such as the Internet). The communicationinterface 3440 may provide such connection using wireless techniques,including digital cellular telephone connection, Cellular Digital PacketData (CDPD) connection, digital satellite data connection or the like.

While the invention has been described with reference to numerousspecific details, one of ordinary skill in the art will recognize thatthe invention can be embodied in other specific forms without departingfrom the spirit of the invention. Thus, one of ordinary skill in the artwould understand that the invention is not to be limited by theforegoing illustrative details, but rather is to be defined by theappended claims.

1. An integrated circuit (IC) comprising: a) an interconnect circuit; b)a first storage element directly connected to said interconnect circuitfor storing in at least one cycle the output of the interconnect circuitfor at least one cycle; and c) a second storage element configurablyconnected to said first storage element for receiving an output of thefirst storage element and for storing said output for at least one cyclebefore supplying the output to a first destination circuit, wherein theoutput of the first storage element directly connects to a seconddestination circuit.
 2. The IC of claim 1, wherein the IC is aconfigurable IC comprising a plurality of configurable circuits.
 3. TheIC of claim 2 further comprising sets of configuration data forcontrolling the plurality of configurable circuits.
 4. The IC of claim3, wherein the sets of configuration data are stored in a configurationdata storage.
 5. The IC of claim 3, wherein the interconnect circuitcomprises an output stage with a third storage element.
 6. The IC ofclaim 5, wherein the output stage directly connects to the first storageelement.
 7. The IC of claim 1, wherein said cycle comprises auser-design clock cycle.
 8. The IC of claim 1, wherein said cyclecomprises a sub-cycle of a user-design clock cycle.
 9. The IC of claim1, wherein the interconnect circuit comprises an output stage and saidoutput stage comprises the first storage element.
 10. The IC of claim 1,wherein the first storage element is not part of the interconnectcircuit.
 11. The IC of claim 1, wherein the interconnect circuit is afirst interconnect circuit, the IC further comprising (i) a plurality ofconfigurable logic circuits for configurably performing a set of logicoperations based on configuration data sets and (ii) a plurality ofconfigurable interconnect circuits for configurably performing a set ofrouting operations based on configuration data sets, wherein theplurality of configurable interconnect circuits comprises the firstinterconnect circuit.
 12. The IC of claim 11 further comprising arouting fabric formed by a set of configurable interconnect circuitsthat includes the first interconnect circuit.
 13. The IC of claim 12,wherein the routing fabric comprises the first storage element and thesecond storage element.
 14. An electronic device comprising: anintegrated circuit (IC) comprising: a) an interconnect circuit; b) afirst storage element directly connected to said interconnect circuitfor storing in at least one cycle the output of the interconnect circuitfor at least one cycle; and c) a second storage element configurablyconnected to said first storage element for receiving an output of thefirst storage element, and for storing said output for at least onecycle before supplying the output to a first destination circuit,wherein the output of the first storage element directly connects to asecond destination circuit.
 15. The electronic device of claim 14,wherein the storage elements and interconnect circuit are configurable.16. The electronic device of claim 14 further comprising a routingfabric, wherein the routing fabric comprises at least one wire segmentused for establishing the direct connection.
 17. The electronic deviceof claim 16, wherein the routing fabric further comprises at least onevia used for establishing the direct connection.
 18. The electronicdevice of claim 16, wherein the routing fabric further comprises atleast one buffer circuit used for establishing the direct connection.19. The electronic device of claim 16, wherein the routing fabric doesnot include interconnect circuits.
 20. The electronic device of claim14, wherein the interconnect circuit comprises an output stage and saidoutput stage comprises the first storage element.
 21. The electronicdevice of claim 14, wherein the first storage element is not part of theinterconnect circuit.
 22. An integrated circuit (IC) comprising: a) aninterconnect circuit; b) a first storage element directly connected tosaid interconnect circuit for storing in at least one cycle the outputof the interconnect circuit for at least one cycle; and c) a secondstorage element configurably connected to said first storage element forreceiving an output of the first storage element and for storing saidoutput for at least one cycle before supplying said output to a firstdestination circuit, wherein an output of the second storage elementdirectly connects to the first destination circuit and to theinterconnect circuit.
 23. The IC of claim 22, wherein the interconnectcircuit is a first interconnect circuit, the IC further comprising (i) aplurality of configurable logic circuits for configurably performing aset of logic operations based on configuration data sets and (ii) aplurality of configurable interconnect circuits for configurablyperforming a set of routing operations based on configuration data sets,wherein the plurality of configurable interconnect circuits comprisesthe first interconnect circuit.
 24. The IC of claim 23 furthercomprising a routing fabric formed by a set of configurable interconnectcircuits that includes the first interconnect circuit.
 25. The IC ofclaim 24, wherein the routing fabric comprises the first storage elementand the second storage element.
 26. An integrated circuit (IC)comprising: a) a plurality of configurable circuits for configurablyperforming a plurality of operations based on configuration data sets,said configurable circuits comprising at least one particularconfigurable interconnect circuit comprising an output stage with afirst storage element; b) a plurality of configuration data storages forstoring a plurality of configuration data sets for defining sets ofoperations performed by the configurable circuits; c) a second storageelement directly connected to said output stage of the particularconfigurable interconnect circuit, said second storage element forstoring in at least one cycle the output of the particular configurableinterconnect circuit for at least one cycle; and d) a third storageelement configurably connected to said second storage element forreceiving an output of the second storage element and for storing saidoutput for at least one cycle before supplying the output to adestination circuit.
 27. An electronic device comprising: an integratedcircuit (IC) comprising: a) a plurality of configurable circuits forconfigurably performing a plurality of operations based on configurationdata sets, said configurable circuits comprising at least one particularconfigurable interconnect circuit comprising an output stage with afirst storage element; b) a plurality of configuration data storages forstoring a plurality of configuration data sets for defining sets ofoperations performed by the configurable circuits; c) a second storageelement directly connected to said output stage of the particularconfigurable interconnect circuit, said second storage element forstoring in at least one cycle the output of the particular configurableinterconnect circuit for at least one cycle; and d) a third storageelement configurably connected to said second storage element forreceiving an output of the second storage element and for storing saidoutput for at least one cycle before supplying the output to adestination circuit.
 28. An electronic device comprising: an integratedcircuit (IC) comprising: a) an interconnect circuit; b) a first storageelement directly connected to said interconnect circuit for storing inat least one cycle an output of the interconnect circuit for at leastone cycle; c) a second storage element configurably connected to saidfirst storage element for receiving an output of the first storageelement and for storing said output for at least one cycle beforesupplying the output to a destination circuit; and d) a routing fabriccomprising at least one wire segment and at least one buffer circuit forestablishing the direct connection.
 29. The electronic device of claim28, wherein the interconnect circuit is a first interconnect circuits,the IC further comprising (i) a plurality of configurable logic circuitsfor configurably performing a set of logic operations based onconfiguration data sets and (ii) a plurality of configurableinterconnect circuits for configurably performing a set of routingoperations based on configuration data sets, wherein the plurality ofconfigurable interconnect circuits comprises the first interconnectcircuit.
 30. The electronic device of claim 29, wherein the IC furthercomprises a routing fabric formed by a set of configurable interconnectcircuits that includes the first interconnect circuit.
 31. Theelectronic device of claim 30, wherein the IC further comprises therouting fabric comprises the first storage element and the secondstorage element.
 32. An electronic device comprising: an integratedcircuit (IC) comprising: a) an interconnect circuit; b) a first storageelement directly connected to said interconnect circuit for storing inat least one cycle an output of the interconnect circuit for at leastone cycle; c) a second storage element configurably connected to saidfirst storage element for receiving an output of the first storageelement and for storing said output for at least one cycle beforesupplying the output to a destination circuit; and d) a routing fabriccomprising at least one wire segment and at least one via forestablishing the direct connection.